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PREFACE 


Finding  and  fixing  software  errors  accounts  for  much  of  the  cost  of  software  development  and 
maintenance.  When  one  includes  the  cost  of  inspections,  testing,  and  rework,  as  much  as  half  or  more 
of  the  typical  development  cost  is  spent  in  detecting  and  removing  errors  (Humphrey  1989). 

The  Software  Error  Estimate  Program  (SWEEP)  aids  you  in  the  management  and  prediction  of  errors 
and  defects  in  software-intensive  systems.  It  supports  you  in  establishing  goals  for  error  detection 
during  software  development  and  can  help  to  track  progress  against  those  goals.  You  can  also  use 
SWEEP  to  monitor  and  help  control  the  quality  of  software  products  by  predicting  the  number  of 
defects  remaining  in  your  software  system — the  latent  error  content.  You  use  the  latent  error  content 
to  estimate  the  amount  of  testing  and  repair  effort  needed  to  reach  different  quality  levels  and  to 
understand  the  quality  of  a  delivered  product.  SWEEP  can  also  help  you  project  software  outages  due 
to  software-related  defects  in  software-intensive  systems. 

SWEEP  supports  a  variety  of  organizational  approaches  to  monitoring  defects  during  software 
development.  Organizations  that  measure  and  track  defects  during  the  software  implementation 
phase  can  cycle  through  development  and  test  intervals,  using  SWEEP  to  predict  the  latent  error 
content  Organizations  that  measure  and  track  defects  throughout  the  life  cycle  can  use  SWEEP  to 
predict  the  latent  error  content  in  one  phase,  based  on  the  data  collected  in  previous  phases.  SWEEP 
can  also  assist  you  in  the  planning  of  software  projects.  You  enter  into  SWEEP  the  number  of  defects 
(per  code  unit)  and  the  estimated  latent  error  content  to  derive  a  profile  pattern  for  error  discovery 
that  will  project  the  number  of  remaining  errors  on  a  phase-by-phase  basis. 

SWEEP  is  applicable  to  a  wide  variety  of  development  efforts,  including  software  products, 
software-intensive  systems,  and  systems  engineering  projects.  SWEEP  is  independent  of  development 
language,  methodology,  or  platform.  You  can  use  SWEEP  to  predict  latent  errors  in  virtually  any 
software  product  including: 

•  Real-time  embedded  software  systems 

•  Command,  communication,  and  control  information  systems 

•  Data  and  information  management  systems 

•  Software  development  support  tools 

SWEEP  is  intended  for  use  by  a  wide  range  of  people  and  organizations  involved  in  the  development 
effort  Project  managers  can  use  SWEEP  to  aid  in  the  scheduling  and  rescheduling  of  project  tasks, 
based  on  latent  error  content  and  quality  goals.  Senior  line  engineers  can  use  SWEEP  to  trade  the 
number  of  errors  in  code  segments  to  determine  when  they  have  finished  unit  and  integration  testing 
efforts.  Software  quality  managers  and  test  and  verification  engineers  can  use  SWEEP  to  estimate  the 


Preface 


time  needed  to  test  code  segments  in  order  to  help  determine  when  *hey  have  completed  sufficient 
testing. 

Organizations  can  apply  SWEEP  during  the  design,  development,  and  maintenance  portions  of  the 
software  development  cycle,  regardless  of  whether  the  process  is  proprietary,  a  classic  waterfall 
approach,  or  an  evolutionary  spiral  approach.  SWEEP  can  benefit  project  and  program  offices  by 
aiding  in  the  planning,  management,  and  development  of  their  products.  Centralized  engineering 
organizations  can  use  SWEEP  to  collect  and  monitor  data,  and  subsequently  to  enact  organizational 
process  improvements  based  on  that  data.  SWEEP  can  be  an  integral  part  of  a  systematic  plan  to 
improve  continuously  the  quality,  development,  and  management  of  deliverables. 

Organizations  can  introduce  SWEEP  into  a  current  development  effort  at  various  levels  of 
involvement.  At  a  minimal  level,  organizations  can  conduct  parallel  development  efforts  to  collect 
data  in  a  nonintrusive  manner.  You  can  use  this  shadow  data  to  assess  the  impact  that  SWEEP  and 
error  prediction  will  have  on  your  organization’s  process.  At  a  more  involved  level,  you  record  the 
number  of  defects  found  during  test  intervals  and  then  use  SWEEP  to  estimate  the  number  of 
remaining  errors.  At  the  maximum  or  most  involved  level,  you  can  use  SWEEP  as  a  planning  aid  to 
estimate  latent  defects  in  various  phases  of  development. 
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1.  INTRODUCTION 


1.1  BACKGROUND 

One  of  the  difficult  challenges  in  today’s  software  development  environment  is  controlling  and 
managing  software  errors.  An  error  is  defined  as  a  detected  failure;  a  failure  is  defined  as  a  manifesta¬ 
tion  of  the  software  product’s  inability  to  conform  to  requirements.  You  can  also  refer  to  errors  as  de¬ 
fects;  many  people  and  projects  use  the  terms  interchangeably.  SWEEP  was  developed  to  aid  with  the 
control  and  management  of  error  data  for  software  projects.  It  allows  you  to  make  software  error  esti¬ 
mates  that  are  useful  in  setting  project  goals  and  monitoring  the  quality  of  the  software  as  it  is  devel¬ 
oped  by  your  organization.  SWEEP  uses  software  error  data  available  early  in  the  development  cycle 
to  project  both  the  number  of  errors  that  will  be  found  in  later  stages  of  development  and  the  errors 
contained  in  the  software  when  it  is  delivered  (latent  error  content).  Latent  error  content  is  expressed 
as  an  error  density;  however,  if  the  user  knows  the  size  of  the  software  code,  the  latent  error  content 
can  easily  be  converted  to  the  number  of  errors  at  delivery. 

12  BENEFITS 

SWEEP  is  useful  to  anyone  involved  with  metrics  (data  collection  and  monitoring)  or  verification 
activities  within  a  software  project’s  software  life  cycle.  SWEEP  can  be  used  to  make  cost-effective 
decisions  related  to: 

•  Assessing  the  effectiveness  of  the  verification  processes  based  on  the  error  discovery  profile 
generated. 

•  Making  the  potential  process  adjustments  to  remove  errors  earlier,  to  minimize  software 
rework,  and  to  save  time  and  money.  This  manual  does  not  explicitly  address  cost  and  schedule 
impacts  associated  with  software  errors. 

•  Using  the  knowledge  gained  from  inspections  to  predict  the  probable  error  content  of  the 
software  both  in  later  development  phases  and  when  the  product  ships. 

•  Estimating  the  rate  at  which  defects  will  be  discovered  in  delivered  software  products. 

•  Comparing  various  software  products’  error  discovery  histories  that  might  not  otherwise  be 
possible  if  only  time-based  error  data  were  available. 

•  Planning  the  software  development  process  with  respect  to  goal  setting  for  error  discovery. 

•  Estimating  the  number  of  outages  to  be  expected  for  complete  systems,  not  just  for  hardware 
components  (Gaffney,  Ferrando,  and  Jones  1993). 
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One  of  the  main  features  of  SWEEP  is  its  ability  to  predict  latent  error  content.  This  capability  is 
useful  in  making  decisions  related  to  software  maintenance  and  reliability.  SWEEP  offers  a  time- 
based  model,  a  phase-based  model,  and  a  planning  aid  The  time-based  model  allows  you  to  estimate 
and  track  errors  during  system  test  and  integration  cydes.  The  phase-based  model  allows  you  to  pre¬ 
dict  and  track  defects  for  multiple  phases  and  can  provide  defect  information  before  you  execute  any 
code.  This  allows  more  time  for  a  project  to  make  software  process  adjustments  and  to  enhance  the 
cost-effectiveness  of  the  projects’  decisions.  The  planning  aid  enables  an  engineer  to  set  error  discov¬ 
ery  objectives  in  a  software  project  based  on  experience  from  previous  projects.  The  planning-aid 
model  generates  an  error  discovery  profile  based  on  the  corporate  data  to  guide  the  software  project 
toward  those  objectives. 

The  phase-based  model  approach  evolved  from  work  done  at  IBM  (Gaffney  1984)  and  has  been  used 
there  on  large  software  projects.  An  aerospace  company  has  successfully  applied  the  SWEEP 
phase-based  algorithm  to  project  data.  The  SWEEP  time-based  model  algorithm  was  validated  by  the 
Consortium  on  a  large  program. 

13  AUDIENCE  AND  PURPOSE 

SWEEP’S  intended  audience  includes  senior  line  engineers,  project  managers,  software  quality 
managers,  verification/test  managers,  and  technologists  who  are  interested  in  estimating,  monitoring, 
and  managing  the  quality  of  software  under  development.  SWEEP  may  be  useful  for  making  two  types 
of  decisions: 

•  How  to  improve  the  software  process,  based  on  the  data  generated 

•  How  that  data  measures  up  to  quality  goals  for  a  given  software  project 
SWEEP  is  a  tool  that  enables  organizations  to: 

•  Establish  appropriate  goals  for  error  discovery 

•  Estimate  the  e.  or  content  of  software  earlier  in  the  life  cycle  than  is  currently  possible  using 
existing  techniques 

1.4  SWEEP  UPGRADE 

The  SWEEP  tool  has  been  enhanced  to  allow  you  to  create  and  manipulate  activity  sets,  organize  data 
into  groups,  and  customize  as  many  as  nine  life-cycle  phases.  In  addition,  improvements  have  been 
made  to  the  curve  fitting  algorithm  and  to  the  presentation  of  the  incidence  and  cumulative  curves. 

13  MANUAL  ORGANIZATION 

This  manual  describes  how  to  use  SWEEP.  The  manual  is  organized  as  follows: 

•  Section  2,  Using  SWEEP  within  Your  Software  Process,  describes  how  to  introduce  SWEEP 
into  your  development  life  cycle.  It  also  identifies  how  software  projects  can  use  the  three 
SWEEP  modes  both  to  evaluate  better  the  effectiveness  and  efficiency  of  their  current 
development  process  and  to  provide  a  means  for  initiating  process  improvements. 
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•  Section  3,  Invoking  SWEEP,  describes  how  to  install  and  invoke  SWEEP  on  both  an 
IBM-compatible  PC  and  an  Apple  Macintosh  computer. 

•  Section  4,  Operating  SWEEP,  describes  the  operation  of  Mode  1,  the  time-based  mode  1; 
Mode  %  the  phase-based  mode  1;  and  Mode  3,  the  planning  aid.  The  description  covers  the 
menu  bar  selections,  dialog  boxes,  and  windows  presented  during  operation. 

•  Section  5,  Case  Study,  demonstrates  the  use  of  SWEEP  with  real  error  data  from  a  major 
software  development  project. 

•  Appendix  A  describes  the  mathematical  models  used  within  SWEEP 

1.6  TYPOGRAPHIC  CONVENTIONS 

This  manual  uses  the  following  typographic  conventions: 


Serif  font .  General  presentation  of  information. 

Boldfaced  serif  font  .  Section  headings  and  emphasis. 

[  ]  .  Screen  buttons. 

<  >  .  Keyboardkeynames,suchas<RETURN>fortheRetum 

key. 
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2.  USING  SWEEP  WITHIN  YOUR  SOFTWARE 

PROCESS 

2.1  MOTIVATION  FOR  USING  SWEEP 

Undetected  defects  in  delivered  software-intensive  systems  can  cause  disastrous  failures.  Since  the 
cost  of  correcting  software  errors  increases  substantially  in  later  development  stages,  it  is  important 
to  be  able  to  estimate  how  many  defects  exist  in  a  software  program  and  to  correct  them  as  early  in 
the  development  process  as  possible. 

SWEEP  can  help  you  manage  your  software  development  process  better.  You  can  use  SWEEP  as  an 
aid  to  establish  quality  objectives  and  to  determine  if  you  are  meeting  them.  You  can  employ  SWEEP 
to  assist  you  in  meeting  an  availability  or  other  defect-related  quality  objective  established  for  the  soft¬ 
ware  product  you  are  developing.  SWEEP  will  provide  you  with  some  of  the  attributes  required  for 
processing  software  defect-related  data  that  your  software  development  organization  must  have  for 
certification  at  the  higher  levels  of  the  Software  Engineering  Institute  (SEI)  software  process  maturity 
model.  For  example,  the  ability  to  make  software  defect  projections  is  one  of  the  requirements  that 
your  organization  must  meet  to  attain  higher  software  process  maturity  levels  on  the  scale  developed 
by  the  SEI  (Paulk  et  al.  1993). 

SWEEP  will  assist  you  in  making  software  defect  projections;  it  can  estimate  and  predict  latent  defect 
content  According  to  the  SEI  Capability  Maturity  Model  (CMM),  a  Level  1  organization  typically 
detects  and  removes  defects  instead  of  imposing  a  process  focused  on  preventing  defects  (Paulk  et  al. 
1993).  SWEEP  can  help  you  decide  how  to  improve  your  software  process  by  identifying  where  in  your 
process  defects  will  occur.  As  software  organizations  move  towards  higher  maturity  levels,  they  begin 
to  practice  defect  prevention.  The  fundamental  objective  of  software  defect  prevention  is  to  detect 
defects  as  early  as  possible  in  the  development  life  cycle. 

On  the  basis  of  current  SEI  statistics  (characterized  by  the  number  of  organizations  at  Level  1  versus 
Level  4),  most  organizations  concentrate  on  defect  detection  rather  than  defect  prevention  (Kitson 
et  al.  1993).  Using  SWEEP  will  help  you  to  manage  software  defects  throughout  the  software  develop¬ 
ment  process  and  to  create  a  product  whose  characteristics  are  more  predictable.  You  will  be  able  to 
use  systematically  data  obtained  during  inspections  to  predict  better  the  quality  of  your  software.  This 
will  provide  you  with  the  necessary  information  to  take  corrective  action  earlier  ( and  more  cost-effec¬ 
tively)  than  if  you  were  to  wait  until  the  system  was  operational.  SWEEP  enables  you  to  achieve  the 
availability  objectives  established  for  the  software  you  are  building. 

2.1.1  SWEEP  Capabilities 

The  SWEEP  tool  consists  of  three  modes  to  enable  you  to  predict  and  track  the  discovery  of  defects 
and  to  predict  the  latent  defect  content  of  software  products.  SWEEP  enables  organizations  or 
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development  efforts  that  rely  on  system  and  integration  testing  for  identification  and  repair  of  defects, 
to  predict  the  number  of  defects  that  remain  in  the  software  system  after  each  successive  test  interval. 
SWEEP  helps  organizations  that  use  design  reviews  and  verification  activities  to  predict  the  number 
of  defects  for  each  phase  in  a  development  life  cycle.  Finally,  SWEEP  provides  to  organizations  that 
build  similar  software  systems  and  maintain  a  history  of  their  defect  discoveries,  help  in  the  planning 
and  establishment  of  software  quality  goals. 

SWEEP’S  Modes  1  and  2  enable  you  to  use  data  obtained  during  inspections  and  other  verification 
activities  to  determine  if  the  defect  discovery  goals  you  have  established  for  each  stage  of  the  develop¬ 
ment  process  are  being  satisfied.  If  you  discover  significantly  more  defects  than  you  had  projected,  you 
can  correct  your  software  earlier  in  the  process.  SWEEP  Mode  3  includes  a  capability  to  help  you  es¬ 
tablish  goals  for  defect  discovery  during  the  software  development  process.  You  can  use  SWEEP  to 
compare  your  product’s  defect  history  with  that  of  other  products,  to  decide  how  to  improve  your  soft¬ 
ware  process  based  on  how  defects  are  being  detected,  and  to  evaluate  software  quality  as  the  product 
is  being  developed. 

In  order  for  SWEEP  to  be  effective,  a  project  must  apply  SWEEP  consistently  over  the  development 
cycle — at  the  level  of  computer  software  units  (CSU),  computer  software  components  (CSC),  or  com¬ 
puter  software  configuration  items  (CSCI).  For  example,  if  you  collect  and  enter  defect  data  at  the 
CSC  level  during  preliminary  design  review,  you  must  also  collect  and  enter  the  defect  data  at  the  CSC 
level  during  detailed  design  review. 

Your  organization  may  use  a  number  of  inspection  or  verification  activities  during  each  of  the  phases 
of  the  development  process.  You  may  have  defect  discovery  data  from  several  inspections  for  detailed 
design,  for  example.  If  you  assume  that  defects  are  evenly  distributed  across  your  design,  then  you  can 
average  the  defect  densities  found  for  each  design  unit  into  one  defect  density.  You  can  then  enter  this 
average  defect  density  number  into  SWEEP. 

You  can  gather  defect  data  from  several  sources,  including  software  problem  reports,  system  error 
logs,  code  inspections,  verifications,  etc.  Defects  are  usually  tracked  as  errors  per  thousand  source 
lines  of  code  KSLOC,  but  your  project  may  use  any  measurement  (e.g.,  modules,  function  points). 

2.1.I.1  Assumptions  Related  to  the  Use  of  the  SWEEP  Tool 
The  following  assumptions  should  be  considered  when  using  the  tool: 

•  All  detected  defects  should  be  recorded  when  they  are  detected. 

•  It  is  assumed  that  defects  are  fixed  when  they  are  discovered  and  that  no  new  defects  are 
injected  during  the  repair  effort. 

•  Defects  should  be  tracked  consistently.  When  using  Mode  2,  if  you  track  a  certain  type  of 
defect  in  the  design  phase,  then  you  should  track  the  same  type  of  defect  throughout  the  life 
cycle  (i.e.,  if  you  choose  to  track  typing  errors,  then  every  phase  should  track  typing  errors). 

•  Defects  in  software  documentation  should  not  be  tracked  with  software  defects. 
Documentation  defects  might  inflate  the  latent  defect  content  of  the  software  product. 

•  The  data  input  into  SWEEP  should  be  validated  and  updated  on  a  regular  basis. 
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2.1.1 2  Mode  1 — Time- Based  Defect  Predictions 

Mode  1  in  the  SWEEP  tool  allows  you  to  estimate  the  defect  discovery  profile  for  your  software 
system  using  time-based  data.  The  time-based  model  is  most  effective  for  projects  that  rely  heavily 
on  testing  to  detect  software  defects  and  that  lack  static  verification  data  obtained  through  inspections, 
peer  reviews,  etc.  Mode  1  can  help  you  both  to  develop  schedule  estimates  to  meet  your  quality  goals 
and  determine  whether  you  will  complete  testing  within  a  given  schedule.  You  can  use  Mode  1  to  help 
you  to  answer  questions  such  as: 

•  What  are  the  estimated  remaining  defects  after  N  more  test  intervals? 

•  How  many  more  test  intervals  do  we  need  to  remove  X  percentage  of  projected  defects? 

•  When  will  we  finish  testing  the  system? 

•  When  can  we  ship  our  software  product? 

You  collect  time-based  data  at  regular  intervals  during  the  integration  test  and  system  test  phases  of 
the  development  process.  The  collected  data  represents  the  number  of  defects  per  code  unit  you  found 
during  a  test  interval.  You  enter  the  collected  data  into  SWEEP  to  predict  the  number  of  latent  defects 
for  future  test  intervals.  After  each  successive  test  interval,  you  enter  actual  defect  data  recorded 
during  that  interval  and  use  SWEEP  to  refine  the  estimates. 

SWEEP  uses  a  Rayleigh  distribution  (Figure  2-1)  to  estimate  the  rate  of  discovery  of  remaining 
defects  in  a  software  system.  The  ideal  graph  of  your  test  data  would  be  a  curve  representing  a  Rayleigh 
distribution.  You  can  use  Mode  1  to  produce  a  graph  of  your  estimated  defects  that  will  have  the  same 
basic  shape  as  the  Rayleigh  distribution.  In  addition,  you  can  use  Mode  1  to  plot  simultaneously  the 
actual  and  estimated  number  of  defects.  These  graphs  offer  an  easy  way  to  compare  the  estimates  of 
your  defect  projections  with  your  actual  data.  You  can  use  the  comparison  as  a  management  indicator 
to  determine  if  you  need  more  testing  intervals,  more  test  personnel,  or  if  you  need  to  make 
adjustments  to  your  development  schedule.  For  example,  if  the  actual  defect  rates  are  lower  than  the 
estimates,  project  management  may  decide  that  adding  additional  test  personnel  will  achieve  the  goals 
more  quickly.  On  the  other  hand,  if  the  actual  defect  rates  are  higher  than  the  estimates,  you  may  need 
to  review  your  testing  process.  It  is  possible  that  you  are  injecting  new  defects  into  the  system  while 
the  identified  defects  are  being  repaired. 

You  can  also  use  Mode  1  to  generate  a  cumulative  curve  that  indicates  the  total  number  of  defects 
found  in  test  intervals.  Figure  2-2  is  an  example  of  a  cumulative  curve  generated  by  Mode  1.  Experi¬ 
ence  has  shown  that  when  a  software  component  is  stable,  the  rate  of  defect  discovery  decreases  with 
time  (Davis  and  Gaffney  1988).  The  cumulative  curve  shows  the  point  at  which  your  rate  of  defect  dis¬ 
covery  is  near  zero,  thus  indicating  that  your  software  has  reached  a  stable  state.  This  type  of  curve 
provides  a  very  valuable  representation  of  when  a  project  is  nearing  the  end  of  testing. 

2.1.13  Mode  2 — Phase-Based  Defect  Predictions 

Mode  2  of  the  SWEEP  tool  allows  you  to  use  a  phase-based  model  to  predict  the  number  of  defects 
you  will  find  in  each  phase  of  the  development  life  cycle.  Mode  2  is  most  effective  for  organizations 
that  perform  static  verification  (e.g.,  inspections,  reviews)  during  the  entire  life  cycle.  Mode  2  allows 
you  to  predict  the  number  of  defects  that  will  be  present  in  the  software  system  before  it  enters  the 
implementation,  testing,  and  integration  phases.  Mode  2  has  the  following  advantages  over  Mode  1: 
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Figure  2-1.  Typical  Rayleigh  Distribution  Curve 


Time 

Figure  2-Z  Cumulative  Farm  of  Rayleigh  Distribution 


•  You  can  plan  and  schedule  testing  activities  more  accurately  for  the  latter  phases  of 
development  due  to  earlier  prediction  of  defects. 

•  You  can  use  your  defect  data  to  predict  the  discovery  of  defects  in  latter  phases  of  the 
development  life  cycle. 

•  You  can  predict  defects  before  your  software  is  in  an  executable  state. 

•  Your  organization  can  compare  the  defect  discovery  histories  for  various  software  products. 

SWEEP  provides  six  default  phases  of  a  development  process:  preliminary  design,  detailed  design, 
coding,  unit  test,  integration  test,  and  system  test  You  can  customize  the  names  of  these  phases,  and 
you  can  add  up  to  three  additional  phases  to  represent  more  accurately  your  development  process. 
During  the  preliminary  and  detailed  design  phases,  you  collect  defect  data  from  static  verification  acti¬ 
vities.  Estimate  an  expansion  ratio  from  the  size  of  the  design  (such  as  number  of  data  flow  diagram 
bubbles  or  number  of  design  pseudo-code  statements)  to  the  size  of  the  code,  based  on  the  experience 
of  your  organization.  For  example,  you  may  decide  that  a  bubble  in  your  design  representation  maps 
to  about 500 source  lines  of  code.  Use  the  expansion  ratio  to  calculate  defect  densities,  and  enter  those 
numbers  into  SWEEP  to  produce  an  estimate  of  the  defect  latency  for  latter  phases.  As  you  progress 
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toward  the  latter  development  phases,  input  the  new  defect  data  to  increase  the  accuracy  of  your  pro¬ 
jections. 

The  accuracy  of  defect  density  projection  in  Mode  2  is  a  function  of  the  quality  of  the  defect  counts 
that  you  provide  and  a  two-parameter  Rayleigh  distribution  (see  Appendix  for  details).  The  phase- 
based  model  requires  defect  counts  from  at  least  two  different  phases  to  develop  a  prediction.  The 
phases  need  not  be  contiguous.  The  accuracy  of  the  SWEEP  estimation  algorithm  will  improve  pro¬ 
portionally  as  you  enter  actual  defect  counts  from  the  other  phases  of  your  development  process.  The 
accuracy  in  Mode  2  will  also  be  greater  if  you  use  the  average  of  the  densities  found  in  several 
inspections,  rather  than  just  in  one  inspection. 

Mode  2  covers  the  basic  phases  of  the  software  development  life  cycle  where  verification  activities  are 
common  practice.  Use  of  Mode  2  with  defect  data  obtained  from  the  requirements  activities  of  the 
software  development  process  requires  special  consideration.  As  verification  methods  for  the  re¬ 
quirements  phase  mature,  it  may  be  feasible  in  future  releases  of  SWEEP  to  add  phases  for  require¬ 
ments  analysis  and  other  systems  engineering-related  activities.  For  the  present,  defects  detected 
during  requirements  analysis  should  be  considered  preliminary  design  defects. 

2.1AA  Mode  3— SWEEP  Planning  Aid 

Mode  3  of  the  SWEEP  tool  is  a  planning  aid  to  help  you  develop  estimated  defect  discovery  and 
tolerance  profiles.  Mode  3  is  most  effective  with,  and  designed  for,  organizations  that  maintain  a 
history  of  latent  defect  counts  over  entire  life  cycles.  You  can  use  this  historical  data  to  estimate  the 
total  number  of  defects  for  each  of  the  phases  during  a  life  cycle  and  the  peak  location  of  the 
distribution. 

The  planning  aid  allows  you  to  set  objectives  for  defect  discovery  ranges,  based  on  user-selected 
criteria.  You  can  enter  data  representing: 

•  Ibtal  number  of  defects  injected  per  code  unit  and  an  estimate  of  your  desired  latent  defect 
content 

•  Ibtal  number  of  defects  injected  per  code  unit  and  the  estimated  peak  of  the  Rayleigh 
distribution 

•  Defect  tolerances,  based  on  either  a  percentage  of  the  defects  per  phase  or  a  signal-to-noise 
ratio 

SWEEP  computes: 

•  Distribution  of  defects  per  phase  for  each  of  the  phases  that  you  have  defined 

•  Upper  and  lower  defect  tolerances,  either  in  terms  of  upper  and  lower  bounds  or  a 
signal-to-noise  ratio 

•  Either  the  peak  location  of  errors  or  the  latent  defect  content 


Using  Mode  3,  you  can  establish  a  desired  goal  for  the  defect  discovery  profile,  and  upper  and  lower 
tolerance  profiles  related  to  the  projections.  You  define  an  acceptable  defect  discovery  range  by  using 
the  upper  and  lower  tolerances  profiles.  This  range,  called  the  defect  tolerance  range,  relates  to  the 
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control  limits  of  your  software  quality  control  process.  You  can  use  the  defect  tolerance  range  to 
identify  and  anticipate  the  point(s)  at  which  a  software  project  deviates  from  this  quality  range.  This 
will  enable  you  to  take  earlier,  more  cost-effective  actions  to  rectify  problems  in  the  development 
process.  Both  the  establishment  of  defect  discovery  goals  and  the  monitoring  of  progress  against  those 
goals  are  fundamental  aspects  of  quantitative  software  management,  equal  in  importance  to 
establishing  and  monitoring  cost,  schedule,  and  size  objectives. 

You  can  establish  goals  for  the  total  defect  injection  and  latent  defect  content  values  in  a  number  of 
ways.  One  way  is  to  base  these  values  on  previous  experience  and  take  the  values  observed  by  earlier 
projects.  You  might  determine  that  you  have  a  better  development  process  than  previous  projects  and 
are  willing  to  establish  more  ambitious  quality  goals.  Your  current  development  process  may  include 
the  use  of  a  more  effective  design  methodology  and  computer-aided  software  engineering  (CASE) 
tools.  This  new  process  could  reduce  the  number  of  defects  introduced  in  the  system  and  might  allow 
you  to  choose  a  smaller  injected  defect  content.  Your  process  may  also  include  improved  verification 
methods  that  would  allow  the  distribution  peak  to  occur  earlier  in  the  development  process. 

Alternatively,  you  might  establish  the  parameters  of  the  goal  defect  discovery  profile  based  on  the 
reliability  requirements  that  the  product  must  satisfy.  For  example,  a  software  product  might  be 
required  to  have  fewer  than  a  specified  number  of  latent  defects  for  it  to  meet  a  reliability  objective. 
The  tolerance  band  about  the  goal  defect  discovery  profile  could  correspond  either  to  its  ability  to 
satisfy  your  customer’s  reliability  requirement  or  the  degree  to  which  you  believe  that  your  process 
can  control  the  level  of  defects  introduced  into  your  software.  Another  alternative  for  the  selection 
of  the  defect  discovery  goal  profile  is  a  combination  of  customer  requirements  and  general  process 
improvement  objectives  (including  verification)  that  have  been  established  for  your  organization. 

2.1.2  Benefits  of  Using  SWEEP 

SWEEP  provides  support  in  two  key  process  improvements  activities,  in  addition  to  the  benefit  of 
improving  your  management  of  defects  within  a  software  product.  First,  SWEEP  can  provide 
improved  management  insight  into  both  your  development  process  and  how  well  you  are  applying  that 
process  to  developing  products.  Management  personnel  can  have  better  access  to  more  information 
earlier  in  the  development  process  and  be  able  to  make  better  management  decisions.  Second, 
SWEEP  generates  valuable  data  for  project  planning  and  scheduling,  thus  improving  your 
organization's  ability  to  perform  and  meet  milestones. 

2.1.2.1  Program  or  Project  Office 

SWEEP  has  the  potential  to  provide  many  benefits  to  organizations,  such  as  program  or  project  offices 
that  are  responsible  for  the  development  of  software  products  and  subsystems.  The  major  capabilities 
of  SWEEP  that  provide  direct  benefits  to  its  users  include  the  following: 

•  The  time-based  model  (Mode  1)  benefits  project  managers,  quality  managers,  and  test 
personnel  by  enabling  them  to  predict  more  accurately  how  many  test  cycles  will  be  needed 
to  meet  quality  standards.  This  model  also  provides  valuable  project  status  information  to 
project  managers,  thus  enabling  them  to  affect  changes  (i.e.,  adding  new  resources,  revising 
schedules).  This  model  can  also  be  used  to  predict  outages  for  complete  systems. 

•  The  phase-based  model  (Mode  2)  benefits  project  members  and  related  personnel  by 
enabling  them  to  predict  the  level  of  effort  needed  in  each  life-cycle  phase  to  meet  established 
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quality  goals.  This  model  extends  the  time-based  model  to  include  as  many  as  nine  phases.  The 
phase-based  model  allows  project  managers  to  leverage  their  corporate  history  by  using  the 
average  defect  discovery  rates  from  other  projects  as  a  starting  point  for  their  estimates. 

•  The  planning  aid  (Mode  3)  benefits  project  managers  by  helping  them  to  establish  defect 
discovery  goals  and  defect-related  quality  goals.  This  aid  allows  users  to  leverage  their 
historical  data  to  produce  accurate  estimates  of  the  latent  defect  content  for  various  phases 
of  the  life  cycle. 

•  Cumulative  curves  benefit  the  user  by  providing  clear  graphical  representation  of  the  number 
of  defects  discovered  and  a  projection  of  the  remaining  defects.  The  cumulative  curve  is  useful 
in  determining  when  you  have  met  your  quality  goals  and  helps  you  project  how  long  it  will  take 
to  reach  those  goals. 

•  Incidence  curves  benefit  the  user  by  providing  side-by-side  comparisons  of  actual  and 
projected  defect  data.  Incidence  curves  also  provide  a  means  to  track  your  actual  data  against 
your  projected  data.  This  type  of  comparison  provides  an  indication  of  the  percentage  of 
estimated  defects  removed  from  the  system. 

2-1^2-2  Centralized  Engineering  Organization 

Centralized  engineering  or  process  metrics  groups  have  a  very  different  charter  from  that  of  project 
office  and  development  teams.  Still,  these  types  of  organizations  can  benefit  from  SWEEP.  SWEEP 
provides  a  capability  for  establishing  quality  metrics,  in  terms  of  defect  discovery  rates,  and  for  track¬ 
ing  progress  against  those  goals.  Organizations  can  use  SWEEP  to  monitor  progress  against  corporate 
goals  and  to  increase  management  insight  into  development  processes.  The  data  can  be  used  as  part 
of  a  corporate  or  divisional  process  improvement  plan.  In  addition,  SWEEP  can  be  beneficial  to 
organizations  seeking  to  improve  the  maturity  of  their  software  engineering  capability. 

Your  organization  can  use  the  SWEEP  tool  to  help  you  meet  some  of  the  goals  of  the  key  practices 
described  in  SETs  CMM.  The  CMM  provides  a  model  for  determining  the  level  of  maturity  of  soft¬ 
ware  engineering  practices  within  organizations.  The  model  defines  five  level,  as  shown  in  Table  2-1. 


Tbble  2-1.  Capability  Maturity  Model 


Level 

Description  of  Organization 

Level  1:  Initial  Level 

Organizations  lack  effective  project  management  and  do  not  maintain  a 
solid,  stable  environment  for  developing  and  maintaining  software  systems. 

Level  2:  Repeatable 
Level 

Organizations  maintain  policies  and  procedures  for  managing  and 
developing  software  systems.  Project  planning  is  based  upon  experience  with 
previously  developed  systems. 

Level  3:  Defined 
Level 

Organizations  have  developed  and  documented  a  standard  process  for 
managing  and  developing  software  systems.  This  process  embodies  the 
policies  and  procedures  developed  at  Level  2. 
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Table  2-1,  continued 


Level 

Description  of  Organization 

Level  4:  Managed 
Level 

Organizations  set  quantitative  goals  for  both  software  products  and 
development  processes.  Software  processes  use  measurement  instruments 
to  collect  important  process  and  product  metrics. 

Level  5:  Optimizing 
Level 

Organizations  are  focused  on  continuous  process  improvement.  Through  the 
use  of  documented  software  processes  and  collected  metrics,  organizations 
can  determine  their  weaknesses  and  develop  strategies  to  correct  them. 

Each  CMM  maturity  level  prescribes  a  number  of  key  practices  that  your  organization  must  be  able 
to  demonstrate  in  order  to  be  certified  at  that  level.  You  can  use  SWEEP  to  develop  and  mature  some 
of  the  capabilities  you  will  need  to  meet  the  goals  outlined  in  these  key  practices.  The  following  sec¬ 
tions  briefly  describe  how  you  can  use  SWEEP  within  your  organization  to  meet  several  of  the  goals 
of  these  key  practices.  Refer  to  Paulk  et  al.  (1993)  for  a  complete  description  of  the  CMM  and  its  goals 
and  activities  for  each  level. 

•  Level  2:  Repeatable:  One  goal  of  the  CMM’s  software  project  planning  practice  is  to  document 
software  estimates  for  use  in  planning  and  tracking  a  software  project.  Using  Modi;  3,  project 
teams  can  develop  an  error  discovery  profile  that  indicates  the  number  ol  defects  for  each 
phase  of  development.  Project  managers  can  use  this  data  to  help  plan  and  schedule 
development  and  test  resources. 

Two  goals  of  the  software  project  tracking  and  oversight  practice  are: 

-  Actual  results  and  performances  are  tracked  against  software  plans. 

-  Corrective  actions  are  taken  and  managed  to  closure  when  actual  results  and 
performance  deviate  significantly  from  the  software  plans. 

Project  teams  can  use  either  Mode  1  or  Mode  2  to  develop  error  discovery  profiles  that 
indicate  the  number  of  defects  discovered  to  date  and  that  estimate  the  number  of  defects  at 
latter  stages  of  development.  Using  the  error  discovery  profile,  project  managers  can  take 
corrective  action  by  adjusting  project  plans  and  schedules. 

One  goal  for  the  software  quality  assurance  practice  is  to  verify  objectively  that  software 
products  and  activities  adhere  to  the  applicable  standards,  procedures,  and  requirements. 
Project  teams  can  use  either  Mode  1  or  Mode  2  to  track  the  project’s  progress  against  the 
quality  goals  of  the  project.  Using  both  the  incidence  and  cumulative  curves,  project  managers 
can  verify  that  their  software  products  meet  the  defect  removal  goals  set  for  the  project. 

•  Level  3:  Defined:  One  goal  of  the  software  product  engineering  practice  is  to  ensure  that 
software  work  products  are  kept  consistent  with  each  other.  SWEEP  provides  a  consistent 
means  to  estimate  defect  discovery  profiles  for  software  products.  The  profiles  developed  for 
each  software  unit  will  help  to  ensure  that  each  unit  meets  the  quality  goals  for  the  project. 
SWEEP  can  also  be  used  in  the  measurement  and  analysis  activities  for  this  practice.  These 
activities  include  tracking  the  number  and  severity  of  defects,  both  cumulatively  and  by  stage. 

•  Level  4:  Managed:  One  goal  of  the  quantitative  process  management  practice  is  to  establish 
organization-wide  quality  goals  and  procedures.  Project  teams  can  use  Mode  2  or  Mode  3  to 
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utilize  the  defect  discovery  goals  established  by  their  organization.  Projects  can  use  the 
profiles  generated  by  SWEEP  (using  the  organization's  quality  goals)  during  project  planning 
activities  to  estimate  schedules  and  resource  needs.  SWEEP  can  help  projects  to  estimate  the 
amount  of  time  and  resources  that  testing  will  need  in  each  phase.  Also,  SWEEP  can  help 
projects  estimate  if  the  current  testing  and  development  schedules  will  enable  them  to  meet 
their  quality  goals.  If  it  seems  likely  that  current  schedules  will  not  enable  a  project  to  meet 
the  organization’s  quality  goals,  project  teams  can  take  steps  to  correct  their  project  plans. 

Attaining  the  objectives  of  the  process  management  practice  includes  the  various  operations 
to  which  SWEEP  can  contribute,  such  as  establishing  goals  and  measuring  their  degree  of  real¬ 
ization,  and  establishing  tolerances  on  quality  objectives  and  ensuring  that  they  are  not 
breached. 

One  ot  the  goals  of  the  software  quality  management  practice  is  to  quantify  and  manage  the 
actual  progress  towards  achieving  the  quality  goals  for  the  software  product.  Once  an  orga¬ 
nization  has  established  quality  goals,  project  teams  can  use  SWEEP  to  monitor  their  progress 
in  removing  defects  to  meet  those  goals.  SWEEP  provides  both  incidence  and  cumulative 
curves  to  help  projects  track  their  actual  progress. 

•  Level  5:  Optimizing:  One  of  the  goals  of  the  defect  prevention  practice  is  to  plan  defect 
prevention  activities.  At  this  level,  defect  prevention  activities,  in  addition  to  the  normal 
testing  and  removal  activities,  are  included  in  the  software  development  plan.  Project  teams 
can  use  SWEEP  to  predict  the  discovery  of  defects  at  various  stages  of  the  development 
process.  These  predictions  are  used  to  develop  strategies  to  prevent  the  injection  of  defects. 

2.2  APPLYING  SWEEP  TO  SOFTWARE  DEVELOPMENT  PROCESSES 

SWEEP  is  applicable  to  any  development  process  used  by  the  project.  Mode  1  is  typically  used  during 
system  test  and  integration  activities,  while  Mode  2  is  used  throughout  the  life  cycle.  The  following 
sections  describe  how  you  might  use  SWEEP  on  a  project  following  a  classic  waterfall  process  and  on 
a  project  using  an  Evolutionary  Spiral  Process  (ESP). 

2.2.1  Classic  Waterfall  Process 

The  waterfall  process  model  is  an  extension  of  a  linear  sequence  of  tasks.  The  standard  waterfall 
permits  a  project  to  back  track  through  the  various  phases  until  it  returns  to  the  phase  in  which  a  defect 
was  introduced  (Figure  2-3).  The  historical  definition  of  the  waterfall  as  a  linear  process  restricts  this 
backup  to  corrective  actions  only.  The  model  does  not  allow  iteration  for  any  other  reason. 

You  can  apply  Mode  1  to  the  testing  and  integration  phases  of  the  waterfall  model.  During  the 
implementation  phase,  unit  testing  has  identified  and  eliminated  many  of  the  defects  in  the  system. 
You  can  then  apply  Mode  1  to  project  the  number  of  defects  remaining  in  the  system,  and  you  can 
estimate  the  number  of  test  intervals  it  will  take  to  reach  the  quality  goals  for  the  system. 

You  can  apply  Mode  2  after  you  have  completed  the  preliminary  and  detailed  design  activities  in  the 
design  phase.  During  these  activities,  you  record  the  number  of  defects  found  during  inspection  and 
verification  tasks.  SWEEP  calculates  the  estimated  number  of  defects  remaining  in  the  system  for 
each  of  the  successive  phases.  As  you  progress  through  the  phases,  you  replace  SWEEP’S  projected 
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defect  counts  with  your  own  actual  counts  for  each  phase.  As  you  enter  the  actual  defect  counts,  you 
increase  the  accuracy  of  SWEEP’S  projections  for  fiiture  phases. 


Project  managers  can  use  Mode  3  to  establish  quality  goals  during  the  initial  planning  activities  of  the 
project.  As  the  project  progresses,  management  can  use  the  planning  aid  during  replanning  activities 
to  reestablish  or  reaffirm  quality  goals. 

2.23,  Evolutionary  Spiral  Process 

The  ESP  model  (Software  Productivity  Consortium  1993)  consists  of  five  main-steps.  A  cycle  traverses 
all  five  steps,  represented  by  a  360°  rotation  in  the  graph,  denoting  that  some  aspect  of  the  product 
has  matured  by  a  specified  amount  (see  Figure  24).  A  cycle  starts  in  the  upper  left  of  the  spiral  with 
the  step  to  determine  objectives,  alternatives,  and  constraints,  and  then  moves  clockwise  to  risk 
analysis  and  aversion,  product  development,  and  planning  and  management.  The  transition  from  the 
fifth  step  back  to  the  first  step  of  the  following  cycle  is  controlled  by  committing  resources  explicitly 
to  continue  the  project. 

The  conceptual  ESP  model  is  essentially  a  management  model.  The  model  describes  five  main  steps 
and  several  specific  product  and  process  management  activities  that  can  be  used  in  conjunction  with 
any  life-cycle  model.  In  addition,  you  can  follow  the  ESP  model  activities  to  determine  a  life-cycle  al¬ 
ternative  that  will  adequately  address  your  objectives  and  constraints,  incorporate  the  life  cycle  into 
the  model,  and  subsequently  evolve  the  life  cycle  as  product  development  proceeds  and  objectives  and 
constraints  change. 

The  ESP  model  is  a  repeating  process  that  uses  the  knowledge  and  experience  gained  from  the 
previous  cycles  to  guide  and  improve  future  cycles  that  is,  you  traverse  all  five  steps  of  the  ESP  model 
one  cycle  at  a  time.  As  noted  above  a  cycle  is  a  complete  traversal  of  all  five  steps  that  completely 
mature  the  product  by  the  amount  defined  in  cycle-specific  objectives  and  success  criteria.  A  spiral 
is  one  or  more  cycles  that  combine  to  accomplish  a  specific  objective,  such  as  complete  a  project, 
product  work  product,  or  other  major  milestone.  A  spiral  may  represent  the  complete  life  cycle  or 
may  include  only  those  activities  necessary  to  meet  one  or  more  of  the  life-cycle  states. 

Organizations  can  apply  the  Mode  1  to  projects  during  the  Develop  Product  step  of  the  ESP  model. 
You  apply  Mode  1  during  the  verification  activities  related  to  system  testing  and  integration  of  this 
step.  If  the  spiral  for  your  project  includes  multiple  cycles  involving  systems  test  and  integration 
activities,  you  apply  Mode  1  during  the  Develop  Product  step  of  each  cycle.  For  example,  if  your  spiral 
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Figure  2-4.  Conceptual  Evolutionary  Spiral  Process  Model:  A  Management  Focus 

encompasses  developing  an  alpha-test  prototype,  a  beta-test  prototype,  and  a  final  product  for  a 
software  system,  you  apply  Mode  1  during  the  Develop  Product  step  for  all  three  of  these  system 
releases. 

Mode  2  could  also  be  applied  during  the  Develop  Product  step.  The  Develop  and  Verify  Product 
activities  within  this  step  include  efforts  that  relate  to  requirements  definition,  design  creation,  and 
product  implementation.  You  collect  and  record  defect  data  during  these  efforts  and  use  the  resulting 
SWEEP  projections  to  influence  your  management  decisions.  As  you  progress  through  the  cycles  that 
define  your  spiral,  you  update  your  previous  data  values  with  your  current  defect  counts  or  estimates. 
With  each  new  update,  you  use  SWEEP  to  produce  more  refined  estimates  of  the  latent  defect 
content. 

You  may  also  be  able  to  use  your  SWEEP  projections  from  both  Modes  1  and  2  to  do  the  following: 

•  You  can  apply  the  information  during  the  Risk  Analysis  activities  of  step  2,  Analyze  Risks. 

•  You  can  distinguish  among  multiple  approaches  to  resolving  a  known  risk.  For  example,  if  you 
are  investigating  two  approaches  to  determine  which  is  the  better  solution  to  a  problem,  you 
could  use  SWEEP  to  help  determine  which  approach  is  better  and  which  is  more  problematic. 

•  You  can  improve  and  refine  your  Plan  and  Schedule  activities  in  step  3,  Plan  Development. 
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•  You  can  deveiop  better  insight  into  actual  status  during  the  Review  Progress  activities  in  step 
5,  Manage  and  Plan. 

The  planning  aid  is  useful  during  the  Plan  and  Schedule  activities  in  step  3,  Plan  Development.  Project 
managers  can  use  the  estimates  derived  during  this  step  to  guide  you  when  you  establish  the  quality 
goals  that  you  will  use  during  the  Review  Progress  activities  of  the  Manage  and  Plan  step. 

23  SUMMARY 

SWEEP  is  designed  to  assist  you  in  establishing  and  managing  software  and  system  quality  objectives. 
You  can  use  SWEEP  to  make  error  projection  using  either  time-based  or  development  phase-based 
data.  You  can  also  use  SWEEP  in  statistical  process  control  activities,  establishing  and  monitoring 
goals  and  tolerance  ranges  for  error  discovery  at  each  stage  of  the  development  process. 

SWEEP  provides  you  with  three  modes  of  operation: 

•  Time-based  Model 

You  can  use  the  time-based  model  to  project  the  rate  of  error  discovery  during  operation  of 
the  software  or  system  based  on  data  obtained  in  testing  to  help  you  decide  upon  the 
cost-effectiveness  of  continuing  the  test  process. 

•  Phase-based  Model 

You  can  use  the  phase-based  model  to  predict  the  number  defects  in  later  development  phases 
based  on  error  data  found  during  static  verifications  in  earlier  phases,  before  actual  code 
execution.  It  also  predicts  the  number  of  errors  remaining  in  the  software,  the  latent  error 
content,  when  it  is  shipped. 

•  Planning  aid 

You  can  use  the  planning  aid  to  establish  and  monitor  the  error  discovery  objectives,  on  a 
phase  by  phase  basis  during  the  development  process.  You  can  also  use  planning  aid  in 
statistical  process  control  activities  to  help  you  develop  higher  quality  software. 

You  can  apply  SWEEP  in  many  organization  functions  such  as  project  or  program  offices  and 
centralized  engineering  organizations  that  are  interested  in  software  quality  assessment  and 
measurement,  and  software  process  improvement.  In  addition,  you  can  apply  SWEEP  in  many 
software  development  processes,  ranging  from  the  classic  waterfall  process  to  the  Evolutionary  Spiral 
Process. 
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SWEEP  is  provided  as  a  set  of  Microsoft  Excel  files  on  a  diskette.  The  distribution  diskette  contains 
files  with  approximately  1  megabyte  of  disk  space.  SWEEP  was  developed  using  Microsoft  Windows 

3.1  and  Microsoft  Excel  4.0.  SWEEP  operates  on  an  IBM-compatible  PC  or  a  Macintosh.  Version  4.0 
(or  later)  of  Microsoft  Excel  must  be  installed  on  your  computer  before  installing  SWEEP. 

The  installation  and  operation  instructions  for  SWEEP  assume  that  you  have  a  working  knowledge 
of  the  computer  system  you  will  be  using.  For  the  PC  implementation,  you  should  be  familiar  with 
Microsoft  DOS  and  Windows.  For  the  Macintosh  implementation,  you  should  be  familiar  with  using 
the  Finder  and  other  basic  Macintosh  tools.  You  do  not  need  to  be  familiar  with  Microsoft  Excel  to 
use  the  SWEEP  tool,  but  you  may  find  some  experience  with  Excel  useful  when  using  SWEEP. 

3.1  SWEEP  UPGRADE 

Enhancements  for  SWEEP  2.0  upgrade  SWEEP  from  Excel  3.0  to  Excel  4.0. 

The  following  characterize  the  enhancements  made  to  the  time-based  model.  Mode  1: 

•  The  ability  to  partition  input  data  into  groups  has  been  added. 

•  Improvements  to  the  basic  Rayleigh  curve  fitting  algorithm  have  been  implemented. 

•  The  graphical  presentation  of  the  resultant  cumulative  and  incidence  curves  have  been 
improved. 

The  following  characterize  the  enhancements  made  to  the  phase-based  model.  Mode  2,  and  the 
planning  aid.  Mode  3: 

•  The  ability  to  create,  change,  delete,  show,  and  select  activity  sets  has  been  added. 

•  Modes  2  and  3  have  been  extended  to  handle  between  six  and  nine  phases. 

•  The  graphical  presentation  to  handle  a  variable  number  of  phases  (between  six  and  nine)  has 
been  improved. 

3.2  USING  SWEEP  ON  THE  PC 

3.2.1  Recommended  Minimum  Configuration 

The  recommended  minimum  configuration  for  running  the  PC  version  of  SWEEP  is: 
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•  80386  processor  (tested  on  an  80486  processor) 

•  VGA  monitor  (tested  on  Super  VGA) 

•  4  megabytes  of  random  access  memory  (RAM) 

•  MS-DOS,  version  5.0  (tested  on  version  5.0) 

•  Microsoft  Windows,  version  3.1  or  later  (tested  on  version  3.1) 

•  Microsoft  Excel,  version  4.0  or  later  (tested  on  version  4.0) 

•  Enough  space  on  the  hard  disk  for  the  SWEEP  files  (1  megabyte) 

3.2.2  Upgrading  SWEEP  1.0 

The  data  files  created  under  SWEEP  1.0  are  not  compatible  with  SWEEP  2.0.  The  data  should  be 
reentered  into  SWEEP  2.0. 

3.23  Installing  SWEEP  rn  a  Hard  Disk 

lb  install  the  SWEEP  software  on  a  hard  disk,  perform  the  following  steps: 

•  Start  Windows. 

•  Create  a  subdirectory  named  sweep  under  the  excel  directory  on  the  hard  disk  (i.e., 
C:\excel\sweep). 

•  Insert  the  disk  containing  SWEEP  into  the  disk  drive. 

•  Copy  all  files  from  the  diskette  to  this  subdirectory. 

•  Open  the  Microsoft  Excel  program  group. 

•  Select  File  from  the  menu  bar  and  then  select  New  from  the  choice  of  menu  options.  The  New 
Program  Object  property  sheet  appears. 

•  Clide  the  Program  Item  line  and  click  [OK].  The  Program  Item  Properties  dialog  box  appears. 

•  In  the  Description  field,  type  SWEEP,  then  press  the  <TAB>  key  or  move  the  mouse  to  the 
Command  line  field. 

•  In  the  Command  line  field,  type  C:\excel\sweep\GO_SWEEP.XLM,  then  press  the  <TAB> 
key  or  move  the  mouse  to  the  Working  Directory  field. 

•  In  the  Working  Directory  field,  type  C:\excel\sweep,  then  press  <RETURN>  or  elide  [OK]. 
Installation  of  SWEEP  on  your  hard  disk  is  now  complete. 

3.2.4  Invoking  SWEEP  on  the  PC 

33.4.1  Invoking  SWEEP  From  a  Diskette 

Tb  invoke  SWEEP  from  a  diskette,  perform  the  following  steps: 
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•  Start  Windows. 

•  Make  sure  the  diskette  is  not  write  protected. 

•  Place  the  diskette  containing  SWEEP  into  the  disk  drive. 

•  Double-dick  on  the  Microsoft  Excel  icon  to  start  Microsoft  Excel. 

•  Select  the  Hie  and  Open  options. 

•  In  the  Directories  list  box,  select  the  drive  (a:  or  b:)  into  which  you  have  inserted  the  diskette 
and  dick  [OK].  The  files  contained  on  the  diskette  are  now  displayed  in  the  Files  list  box. 

•  Select  GO  SWEEEXLM  and  dick  the  [OK]  button.  The  software  loads  and  displays  the 
Software  Productivity  Consortium  logo  with  [Begin]  and  [End]  buttons. 

•  Click  the  [Begin]  button  to  dismiss  the  logo.  The  Hie  and  Modes  pulldown  menus  appear. 
You  are  now  ready  to  use  SWEEP  Its  operation  is  described  in  Section  4. 

X1A2  Invoking  SWEEP  From  a  Hard  Disk 

Tb  invoke  SWEEP  from  a  hard  disk,  perform  the  following  steps: 

•  Start  Windows. 

•  Open  the  Microsoft  Excel  program  group. 

•  Double-click  on  the  SWEEP  icon.  The  software  loads  and  displays  the  Software  Productivity 
Consortium  logo  with  [Begin]  and  [End]  buttons. 

•  Click  the  [Begin]  button  to  dismiss  the  logo.  The  File  and  Modes  pulldown  menus  appear. 
You  are  now  ready  to  use  SWEEP  Its  operation  is  described  in  Section  4. 

33  USING  SWEEP  ON  THE  MACINTOSH 

33.1  Recommended  Minimum  Configuration 

The  recommended  minimum  configuration  for  running  the  Macintosh  version  of  SWEEP  is: 

•  A  Macintosh  capable  of  running  Microsoft  Excel,  version  4.0 

•  4  megabytes  of  RAM  with  default  memory  setting  (tested  on  24  megabyte) 

•  Finder,  version  7.01  or  later  (tested  on  version  7.1.1) 

•  Microsoft  Excel,  version  4.0  or  later  (tested  on  version  4.0) 

•  Enough  space  on  the  hard  disk  for  the  SWEEP  files  (1  megabyte) 
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The  data  files  created  under  SWEEP  1.0  are  not  compatible  with  SWEEP  2.0.  The  data  should  be 
reentered  into  SWEEP  2.0. 

333  Installing  SWEEP  on  a  Hard  Disk 

Tb  install  SWEEP  on  your  hard  disk,  perform  the  following  steps: 

•  Double-click  on  the  hard  disk  icon.  A  window  displaying  the  hard  disk  contents  appears. 

•  Place  the  SWEEP  diskette  in  the  disk  drive. 

•  Double-click  on  the  diskette  ioon.  A  window  displaying  the  diskette  oontents  appears. 

•  Copy  the  SWEEP  folder  by  clicking  once  on  the  SWEEP  icon  and  dragging  it  to  the  hard  disk 
icon.  Release  the  mouse  button  to  start  the  copy  procedure.  The  contents  of  the  folder  are 
placed  in  a  folder  named  SWEEP. 

Installation  of  SWEEP  on  your  hard  disk  is  now  complete. 

33.4  Invoking  SWEEP  on  the  Macintosh 

334.1  Invoking  SWEEP  From  a  Diskette 

Tb  invoke  SWEEP  from  a  diskette,  perform  the  following  steps: 

•  Make  sure  the  diskette  is  not  write  protected. 

•  Place  the  diskette  containing  SWEEP  in  the  disk  drive. 

•  Open  the  diskette  by  double-clicking  on  the  diskette  icon.  A  window  displaying  the  diskette 
contents  appears. 

•  Open  the  SWEEP  folder  by  double-clicking  on  the  folder  icon.  A  window  displaying  the  folder 
contents  appears. 

•  Start  the  SWEEP  application  by  double-clicking  on  the  file  GO_SWEERXLM.  The  software 
loads  and  displays  the  Software  Productivity  Consortium  logo  with  [Begin]  and  [End]  buttons. 

•  Click  the  [Begin]  button  to  dismiss  the  logo.  The  File  and  Modes  pulldown  menus  appear. 
You  are  now  ready  to  use  SWEEP.  Its  operation  is  described  in  Section  4. 

3343  Invoking  SWEEP  From  the  Hard  Disk 

Tb  invoke  SWEEP  from  a  hard  disk,  perform  the  following  steps: 

•  Open  the  hard  disk  by  double-clicking  on  the  hard  disk  icon.  A  window  displaying  the  hard  disk 
contents  appears. 
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•  Open  the  SWEEP  folder  by  double-clicking  on  the  folder  icon.  A  window  displaying  the  folder 
contents  appears. 

•  Start  the  SWEEP  application  by  double-clicking  on  the  file  GO  SWEEP.XLM.  The  software 
loads  and  displays  the  Software  Productivity  Consortium  logo  with  [Begin]  and  [End]  buttons. 

•  Click  the  [Begin]  button  to  dismiss  the  logo.  The  File  and  Modes  pulldown  menus  appear. 
You  are  now  ready  to  use  SWEEP.  Its  operation  is  described  in  Section  4. 

3.4  LIST  OF  FILES  INCLUDED  IN  THE  PC  VERSION  OF  SWEEP 
The  PC  version  of  SWEEP  provides  the  following  files: 

MODE1AJOLM  MODE1A1 XLC 

MODE1AIN.XLS  MODE1A.G.XLS 

MODE2AJCLM  MODE2A1.XLC 

MODE3A.XLM  MODE3A1JCLC 

MENUSjAEXLM  GO.SWEERXLM 

SWEEEXLW  MODE1A.C 

BC30KTL.DLL 

3JS  LIST  OF  FILES  INCLUDED  IN  THE  MACINTOSH  VERSION  OF  SWEEP 
The  Macintosh  version  of  SWEEP  provides  the  following  files: 

MODE1A.XLM  MODE1A1.XLC  MODE1A2.XLC 

MODE1AIN.XLS  MODE1A.GJOLS  MODE1AJD.XLS 

MODE2AXLM  MODE2A1.XLC  MODE2A2.XLC 

MODE3A.XLM  MODE3A1.XLC  ACTIVITY  JCLM 

MENUS^AEXLM  GO.SWEEP.XLM  SPC_LOGO.XLS 

SWEEEXLW  SWEEP 


MODE1A2JOLC 

MODE1AD.XLS 

MODE2A2JCLC 

ACTIVITY.XLM 

SPC XOGO.XLS 

SWEEP.DLL 
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4.  OPERATING  SWEEP 


4.1  USER  INTERFACE  CONTENTIONS 

This  section  provides  a  description  of  the  SWEEP  user  interface  and  a  detailed  example  for  each  of 
the  three  SWEEP  modes.  Section  S  provides  a  case  study  describing  the  operation  of  the  time-based 
model.  Mode  1.  Whenever  possible,  SWEEP  adheres  to  the  user  interface  convention  for  Microsoft 
Excel.  These  conventions  help  to  protect  you  from  accidently  modifying  the  macros  and  software  used 
to  implement  SWEEP.  For  more  information  about  Microsoft  Excel  and  its  interface  conventions, 
please  refer  to  the  Microsoft  Excel  User’s  Guide  (Microsoft  1991,  ix-xii). 

This  section  also  includes  notes  on  specific  aspects  of  using  the  SWEEP  tool.  Notes  indude  warnings 
and  explanations.  Some  notes  are  specific  to  the  PC  environment,  some  are  for  the  Madntosh 
environment,  and  some  notes  are  for  both  systems. 

Notes:  The  graphics  induded  in  this  section  are  from  the  Madntosh  implementation  of  SWEEP.  The 
dialog  boxes  and  windows  for  the  PC  implementation  have  a  similar  appearance. 

You  may  not  have  enough  data  points  for  SWEEP  to  properly  fit  a  Rayleigh  curve  to  your  data 
set  In  these  cases,  your  results  will  not  converge.  You  will  observe  that  the  inddence  curve  will 
continue  to  rise  over  the  data  set  that  you  entered. 

4.1.1  Invoking  SWEEP 

Tb  invoke  SWEEP  for  a  PC  system,  follow  the  instructions  provided  in  Section  3.2.4.  lb  invoke 
SWEEP  for  a  Macintosh  system,  follow  the  instructions  in  Section  3.3.4.  SWEEP  first  presents  you 
with  the  Consortium’s  copyright  notice  (see  Figure  4-1).  This  window  consists  of  two  parts:  a  static  text 
area  and  a  custom  menubar.  The  static  text  area  displays  the  Consortium  logo  and  the  copyright  no¬ 
tice.  The  menu  bar  features  [Begin]  and  [End]  buttons.  Click  on  the  [Begin]  button  to  continue  with 
the  SWEEP  tool.  Click  the  [End]  button  to  quit  the  tool. 

4.1.2  Selecting  From  Menus 

Make  a  menu  selection  by  holding  down  the  mouse  button  and  dragging  the  mouse  over  the  menu 
selections  until  you  highlight  your  desired  choice.  Release  the  mouse  button  on  the  selected  option 
to  execute  your  choice. 

4.13  The  File  Menu 

Figure  4-2  shows  the  menu  hierarchy  for  SWEEP’S  functions.  The  File  menu  presents  the  following 
four  options:  Quit,  Close,  Save,  and  Print  Preview.  Initially,  all  the  File  menu  options  are  disabled  and 
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Software  Error  Estimation  Program  (SWEEP)  Version  02 


Copyright  (o)  1993  Soflwar#  Productivity  Consortium  Sorvioos  Corporation,  Htmdon,  Virgini*.  Permission  to  use,  copy ,  modify , 
ond  distribute  this  motoriol  for  any  purpose  and  without  fee  is  hereby  granted  consistent  with  48  CFR  227  and  252,  and  provided 
that  the  above  copyright  notice  appears  in  all  copies  and  that  both  this  copyright  notice  and  this  permission  notice  appear  tn 
supporting  documentation.  This  material  is  based  in  part  upon  work  sponsored  by  the  Mvanoed  Research  Projects  Agency  under 
Grant  •MDA972-92-J-10IS.  The  oentent  does  not  necessarily  reflect  the  position  or  the  pebey  of  the  U.S.  Government,  and  no 
official  endorsement  should  be  inferred.  The  name  Software  Productivity  Consortium  shall  not  be  used  in  advertising  or  publicity 
pertaining  to  this  material  or  otherwise  without  the  prior  written  permission  of  Software  Productivity  Consortium,  Inc. 
SOFTWARE  PRODUCTIVITY  CONSORTIUM,  MC.  AND  SOFTWARE  PRODUCTIVITY  CONSORTIUM 
SERVICES  CORPORATION  MAKE  NO  REPRESENTATIONS  OR  VARR ANTES  ABOUT  THE  SUITABLITY  OF 
THIS  MATERIAL  FOR  ANY  PURPOSE  OR  ABOUT  ANY  OTHER  MATTER,  AND  THIS  MATERIAL  IS  PROVDED 
WITHOUT  EXPRESS  OR  MPLIED  WARRANTY  OF  ANY  KND. 


Figure  4-1.  Copyright  Notice  for  the  SWEEP  Tool 


Print  Preview 


Figure  4-2.  SWEEP  Menu  Hierarchy 

dimmed  except  for  the  Quit  option.  The  File  menu  options  are  active  only  while  you  are  working  in 
one  of  the  modes.  The  File  menu  options  and  related  caveats  are  described  in  Sections  4.2.8  (Mode  1), 
43.7  (Mode  2),  and  4.4.7  (Mode  3). 

41.4  The  Modes  Menu 

The  Modes  menu  allows  you  to  choose  one  of  SWEEP'S  three  modes  of  operation: 

•  Mode  1— Time-Based  Model 

•  Mode  2 — Phase-Based  Model 

•  Mode  3 — Planning  Aid 

lb  choose  a  SWEEP  mode,  select  one  of  the  modes  listed  from  the  Modes  menu.  After  a  selection 
is  made,  all  three  modes  are  dimmed  until  you  complete  the  selected  mode.  Following  the  selection 
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of  any  mode,  you  are  presented  with  a  series  of  dialog  boxes  r  eq  ues  ting  you  to  enter  da  ta.  Dialog  boxes 
also  are  used  to  display  the  calculated  output  data. 

4.2  MODE  1— TIME-BASED  MODEL  OPERATION 

lb  use  Mode  1,  specify  the  number  of  time  intervals  (n)  and  enter  the  number  of  errors  discovered 
during  each  interval.  Time  intervals  may  be  weeks,  days,  or  other  user-specified  time  period.  SWEEP 
fits  your  data  to  a  two-parameter  curve.  One  parameter  is  E,  the  total  number  of  errors  expected  to 
be  made  throughout  software  development  and  discovered  over  the  total  life  cycle.  The  other  variable 
is  the  (Tp)  the  peak  rate  of  defect  incidence  occurs  (Tp).  To  ensure  that  SWEEP  produces  a  valid  esti¬ 
mate  in  Mode  1  operation,  e.g.,  be  careful  when  entering  error  counts.  The  error  discovery  data 
should  be  for  the  same  unit  of  code  size  (e.g.,  KSLOC,  function  points)  in  all  test  intervals  used.  If  the 
size  of  the  code  changes  after  the  test  period,  normalize  the  error  discovery  data  by  the  size  of  the  code 
at  that  time  (e.g.,  if  you  measure  code  size  by  KSLOC,  use  errors/KSLOC  instead  of  errors).  SWEEP 
then  estimates  E  as  lifetime  errors/KSLOC.  Mode  1  is  typically  used  for  periodic  error  input  and 
tracking;  therefore,  SWEEP  maintains  your  data  from  session  to  session. 

1b  start  the  time-based  model,  select  Mode  1 — Time-Based  Model  from  the  Modes  menu.  The 
Number  of  Intervals  dialog  box  (see  Figure  4-3)  appears. 


4.2.1  Number  of  Intervals  Dialog  Box 

The  dialog  box  presents  one  input  field,  an  [OK]  button,  and  a  [Cancel]  button.  Enter  in  the  input  field 
the  number  of  weeks  or  intervals  over  which  error  data  has  been  recorded.  The  input  value  must  be 
greater  than  or  equal  to  2  and  less  than  or  equal  to  100.  After  you  enter  the  number  of  intervals,  click 
[OK]  or  press  the  <RETURN  >  key.  If  you  click  the  [OK]  button  or  press  the  <RETURN  >  key  with¬ 
out  entering  any  data,  an  error  message  will  be  displayed  indicating  that  data  must  be  entered.  Once 
you  have  entered  the  number  of  intervals,  SWEEP  will  close  the  dialog  box  and  open  a  Data  input  box 
(see  Figure  4-4). 

The  Data  input  dialog  box  allows  you  to  enter  the  error  data  for  each  of  the  completed  test  intervals. 
It  offers  a  [Clear  Data]  button,  a  [Set  Group]  button,  a  [Compute]  button,  a  [Cancel]  button,  and  two 
columns  of  data.  The  left  column  is  a  reference  column  listing  the  number  of  intervals.  The  left  column 
extends  down  to  the  maximum  number  of  intervals  (100).  The  right  column  is  the  number  of  defects 
recorded  for  that  interval.  Input  a  value  for  the  number  of  errors  that  were  discovered  for  a  particular 
time  interval.  If  you  do  not  have  data  for  a  particular  interval,  do  not  enter  a  value  in  the  errors  column. 
You  must  enter  a  value,  however,  for  at  least  two  intervals.  If  you  enter  a  space  or  some  other  non- 
numeric  character,  SWEEP  will  display  an  error  message  after  you  click  the  [Compute]  button.  After 


Get  N 


Enter  the  number  of  internals  (n)  that  you  haue  error  data: 

l[  0*  )l  t  Cancel  j 


Figure  4-3.  Number  of  Intervals  Dialog  Box 
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Warning.  Do  not  enter  a  slash  (e.g.,  1/2)  in  the  Intervals  Dialog  Box.  Excel  will  interpret  it  as  a  date 
and  change  it  into  a  date  cell.  To  correct  this,  type  0 1/2  to  return  the  cell  to  a  numeric 
cell. 


I  M0DE1HIN.KLS  f 

[  Clear  Data  ] 

(  Set  Group  1 

l  Compute  j 

[  Cancel  ] 

interval 

Errors 

1 

1 

2 

0 

3 

1 

4 

15 

5 

15 

6 

32 

7 

29 

8 

45 

9 

34 

10 

87 

11 

41 

12 

71 

13 

77 

14 

80  ! 

Figure  4-4.  Example  of  a  Data  Input  Box 

data  entry  is  completed,  click  the  [Compute]  button  to  compute  the  error  projections.  If  you  click  the 
[Cancel]  button  while  this  dialog  box  is  displayed,  you  will  return  to  the  Main  window. 

You  can  erase  all  previously  entered  data  by  clicking  the  [Clear  Data]  button.  This  erases  all  input  data 
from  the  errors  column  (the  right  column).  SWEEP  will  ask  you  to  confirm  that  you  wish  to  clear  the 
data  before  any  data  is  deleted  (see  Figure  4-5). 


A  Selecting  OK  urill  erase 

ok 

Error  Column.  Rre  you 

Cancel  ] 

sure? 

Figure  4-5.  Gear  Data  Confirmation  Dialog  Box 


Example-.  You  have  error  data  for  13  weeks.  Enter  “13”  into  the  Number  of  Intervals  dialog  box,  and 
click  [OK].  Click  the  [Clear  Data]  button  in  the  Data  input  dialog  box,  click  the  [OK]  button 
in  the  Clear  Data  Confirmation  dialog  box,  and  then  enter  your  error  data,  one  value  per 
cell: 


12134851  10  8964 
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After  your  error  data  has  been  entered,  you  can  click  either  the  [Set  Group]  or  the  [Compute]  buttons. 
If  you  click  the  [Set  Group]  button,  the  Grouping  input  box  appears  (see  Figure  4-6).  This  box  allows 


Set  Group 


Current  number  of  interval  (n): 
Number  of  intervals  per  group: 

1-Data  Trimming  Options - 

O  Trim  data  from  front 
<•>  Trim  data  from  back 


Shoiv  Group 


23 


Cancel 


Figure  4-6.  Grouping  Input  Box 

you  to  group  the  error  data  for  analysis.  The  grouping  function  organizes  consecutive  intervals  into 
groups.  For  example,  if  you  select  two  intervals  per  group,  then  the  values  for  intervals  1  and  2  will 
be  added  together  as  group  1;  intervals  3  and  4  will  be  added  together  for  group  2;  etc.  If  the  error  data 
is  sparse  (many  intervals  have  only  a  few  or  no  errors),  you  will  improve  the  accuracy  of  your  estimates 
by  grouping  interval  error  data  into  larger  groups.  Grouping  can  also  smooth  your  data  and  may  im¬ 
prove  the  accuracy  with  which  SWEEP  plots  your  data.  Input  the  number  of  intervals  to  be  included 
in  each  group.  If  the  number  you  select  for  your  intervals  per  group  does  not  divide  evenly  into  the 
number  of  intervals  in  your  data  set,  then  you  must  decide  whether  to  discard  extra  intervals  from  the 
beginning  of  your  data  or  the  end  of  your  data.  Select  whether  to  trim  data  from  the  front  or  the  back 
of  the  entered  error  data. 

Note:  Set  Group  will  treat  a  grouping  of  null  values  as  0.  Tb  obtain  a  result  that  reflects  null  values, 
reenter  your  grouped  data  in  the  Intervals  Dialog  Box  and  replace  the  0  with  a  null  value. 

Example If  you  have  23  intervals  and  you  select  2  intervals  per  group,  you  will  have  an  extra  interval 
that  cannot  be  put  in  a  group.  You  must  decide  if  you  wish  the  first  interval  or  the  last  to  be 
trimmed.  (The  value  will  be  ignored  only  by  the  grouping  function;  it  will  not  be  deleted  from 
your  data  set.) 

Click  the  [Show  Group]  button  to  display  the  Grouping  display  box,  which  shows  the  number  of  errors 
in  each  group  (see  Figure  4-7).  Several  minutes  may  elapse  before  this  box  is  displayed,  depending 
on  the  number  of  intervals  you  have  entered  and  the  speed  of  your  computer. 

Example-.  You  want  to  combine  successive  error  intervals,  discarding  the  last  interval.  In  the  Grouping 
input  box,  enter  2  into  the  Number  of  Intervals  Per  Group  input  field,  sel  ect  Him  Data  from 
Back,  and  then  click  the  [Show  Group]  button.  The  Grouping  display  box  then  shows  the 
grouped  data  (a  portion  of  the  original  error  data  is  displayed  in  Figure  44). 


4-5 


ling  SWEEP 


f  M0DE1R-G.HLS  1 

[  Compute  J 

f  Cancel  ] 

Grots) 

Errors 

l 

1  V  1 

2 

16  | 

3 

47 

4 

74 

5 

101 

6 

112 

7 

157 

8 

122 

9 

152 

10 

99 

11 

102 

12 

13 

14 

Figure  4-7.  Grouping  Display  Bax 


4.12  Estimated  Errors  Dialog  Box 

When  you  dick  the  [Compute]  button  in  either  the  Data  Input  or  Grouping  display  boxes,  error 
projections  are  computed  and  the  Estimated  Errors  dialog  box  (see  Figure  4-8)  appears. 


Estimated  Errors 


Errors  Discouered  to  Date:  1013 

Total  Errors  Projected:  1 455.04 

Percentage  of  Projected  Errors  Found  to  Date:  69.62 

Estimated  Location  of  Peak:  14.90 

rError  Estimate  Selection - 


©Estimate  Number  of  Errors  in  Neut  (m)  Internals 

O  Determine  Number  of  Internals  Required  to  Rchieue 
the  Following  %  (p),  based  on  the  Total  Number  of 
Errors  Projected 


Enter  Data  (m  or  p):  |iq|  |  ([compute  ])  [  Cancel  ) 


[  Cumulatiue  Curue  )  [  Incidence  Curue  ]  [  Data  Sheet  ) 


Figure  4-8.  Using  (m)  in  the  Estimated  Errors  Dialog  Bax 
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The  dialog  box  displays  the  following  four  outputs  based  on  the  user  inputs: 

•  Errors  Discovered  to  Date.  The  sum  of  the  number  of  errors  you  have  entered. 

•  Total  Errors  Projected.  The  estimated  number  of  errors  injected  in  the  software. 

•  Percentage  of  Projected  Errors  Found  to  Date.  The  percentage  of  total  errors  estimated  to  be 
discovered  through  interval  (n). 

•  Estimated  Location  of  Peak.  T!  -  peak  of  the  Rayleigh  curve  that  SWEEP  fits  to  your  data. 
The  dialog  box  also  features  five  buttons: 

•  Compute.  Either  estimate  the  number  of  errors  in  the  next  intervals  or  determine  the  number 
of  intervals  required  to  remove  the  given  percentage  of  errors. 

•  Cumulative  Curve.  Draw  the  cumulative  curve  for  this  data  set. 

•  Incidence  Curve.  Draw  the  incidence  curve  for  this  data  set 

•  Cancel.  Close  the  Estimated  Errors  dialog  box  and  return  to  the  Data  Input  or  Grouping  dialog 
box. 

•  Data  Sheet.  Display  the  underlying  data  sheet. 

The  two  radio  buttons  in  the  Error  Estimate  Selection  segment  of  the  dialog  box  enable  you  to  select 
one  of  the  two  types  of  estimate  that  SWEEP  calculates.  These  buttons  are  used  in  conjunction  with 
the  [Compute]  button. 

The  first  radio  button  option  can  help  you  determine  if  testing  should  be  performed  for  a  given 
interval,  based  on  the  number  of  errors  expected  to  be  discovered.  This  option  allows  you  to  estimate 
the  number  of  errors  projected  in  the  next  (m)  intervals.  If  you  select  this  option,  you  must  enter  an 
integer  value  greater  than  zero.  SWEEP  computes  the  total  number  of  errors  expected  to  be 
discovered  in  the  next  (m)  intervals  and  provides  an  estimate  of  the  percentage  of  total  errors  that  (m) 
and  the  initial  (n)  intervals  entered  represent. 

Select  the  first  radio  button  option,  enter  a  positive  integer  for  (m),  and  dick  the  [Compute]  button. 
This  opens  the  Calculated  Values  Based  on  (m)  dialog  box  (discussed  in  Section  4.2.3  and  shown  in 
Figure  4-9).  The  dialog  box  displays  outputs  computed  as  a  result  of  your  input  for  (m).  Clicking  the 
[OK]  button  returns  you  to  the  Estimated  Errors  dialog  box. 

Example:  You  want  to  estimate  how  many  errors  will  occur  in  the  next  10  test  intervals.  Select  Estimate 
Number  of  Errors,  then  enter  10  in  the  Enter  Data  input  field,  and  click  the  [Compute] 
button. 

The  second  radio  button  option  can  help  you  determine  how  many  more  test  intervals  are  required 
to  detect  a  given  percentage  of  the  injected  errors.  This  option  allows  you  to  enter  a  percentage  (p), 
where  (p)  is  less  than  100  and  greater  than  the  percentage  of  errors  discovered  to  date.  SWEEP  indi¬ 
cates  both  the  number  of  intervals  required  to  achieve  that  percentage  and  how  many  intervals  beyond 
the  (n)  intervals  are  left  to  achieve  (p). 
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-Estimated  Errors  based  on  Intervals  Entered  (n) - 

Number  of  Intervals  Entered:  23 
Total  Errors  Discovered  to  Date:  1013 
Total  Errors  Projected:  1 455.04 

Intervals  Needed  to  Achieve  99.99%  of  Total  Errors:  63.95 


-Estimated  Total  Errors  based  on  Intervals  through  m  (n+m)— 
Number  of  Intervals  Estimated  (m):  10 
Total  Errors  Discovered  through  Interval  (m):  1329.80 
Estimated  Errors  in  (ml  Intervals:  316.80 
Percentage  of  Total  Errors:  91.39 


Figure  4-9.  Calculated  \felues  Based  cm  (m)  Dialog  Bax 

Select  the  second  radio  button  option,  enter  a  value  for  (p),  and  click  the  [Compute]  button.  This 
opens  the  Calculated  Values  Based  on  (p)  dialog  box  (discussed  in  Section  4.2.4),  which  displays 
outputs  computed  from  your  input  for  (p).  Clicking  the  [OK]  button  returns  you  to  the  Estimated  Errors 
dialog  box. 

4.23  Calculated  Values  Based  on  (m)  Dialog  Box 

The  Calculated  Values  Based  on  (m)  dialog  box,  shown  in  Figure  4-9,  features  eight  output  fields  and 
an  [OK]  button. 

The  output  fields  provide  you  with  the  following  information: 

•  Number  of  Intervals  Entered  (n).  The  number  of  test  intervals  for  which  you  have  data. 

•  Total  Errors  Discovered  to  Date.  The  sum  of  the  error  counts  that  you  have  entered  through 
interval  (n). 

•  Total  Errors  Projected.  The  estimated  total  number  of  errors  injected  into  the  software. 

•  Intervals  Needed  to  Achieve  99 39  Percent  of  Toted  Errors.  The  number  of  test  intervals  estimated 
to  be  required  to  discover  99.99  percent  of  the  total  errors  projected. 

•  Number  of  Intervals  Estimated  (m).  The  number  of  intervals  beyond  the  number  of  intervals 
entered  (n)  for  which  you  would  like  to  estimate  the  number  of  errors  to  be  discovered. 

•  Total  Errors  Discovered  Through  Interval  (m).  The  estimated  number  of  errors  expected  to  be 
discovered  through  (n+m)  intervals. 
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•  Estimated  Errors  in  (m)  Intervals.  The  number  of  errors  estimated  to  be  discovered  during  the 
next  (m)  intervals. 

•  Percentage  of  Total  Errors.  The  percentage  of  total  errors  estimated  to  be  discovered  through 
(n+m)  intervals. 

Clicking  the  [OK]  button  returns  you  to  the  Estimated  Errors  dialog  box. 


424  Calculated  Values  Based  on  (p)  Dialog  Box 

You  can  also  estimate  the  number  of  intervals  required  to  achieve  a  given  percentage  of  defects 
discovered  in  the  system.  In  the  Estimated  Errors  dialog  box,  select  the  second  radio  button  option 
in  the  Error  Estimate  Selection  area  (see  Figure  4-10).  Click  the  [Compute]  button.  The  Calculated 


Figure  4-10.  Using  %  (p)  in  the  Estimated  Errors  Dialog  Bax 

Values  Based  on  (p)  dialog  box,  depicted  in  Figure  4-11,  appears.  This  dialog  boxfeatures  seven  output 
fields  and  an  [OK]  button.  The  output  fields  provide  the  following  information: 

•  Number  of  Intervals  Entered.  The  number  of  test  intervals  for  which  you  have  data. 

•  Total  Errors  Discovered  to  Date.  The  sum  of  the  number  of  errors  for  which  you  have  entered 
data. 

•  Total  Errors  Projected.  The  estimated  total  number  of  errors  to  be  injected  into  the  software. 

•  Percentage  of  Estimated  Errors  Discovered.  The  percentage  of  total  errors  discovered  through 
interval  (n). 


Percentage  (p)  Entered  for  Estimate.  The  value  is  input  to  determine  how  many  more  intervals 
are  required  to  achieve  a  specified  percentage  (p)  of  the  total  errors  projected. 
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Calculate  P 

r*  ii  ’rrv  inn .  mm\ 

-Estimated  Errors  based  on  Internals  Entered  (n) - 

Number  of  Internals  Entered:  23 

Total  Errors  Discovered  to  Date:  1013 

Total  Errors  Projected:  1 455.04 

Percentage  of  Projected  Errors  Discovered:  69.62 

-Estimated  Errors  bosed  on  Percentage  (p) - 
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Intervals  Remaining  After  In)  to  Achieve  (p):  37.01 

L  JL  j 

Figure  4-11.  Calculated  Values  Baaed  on  (p)  Dialog  Box 

•  Intervals  Needed  to  Achieve  (p).  The  estimated  total  number  of  test  intervals  required  to 
achieve  (p). 

•  Intervals  Remaining  After  (n)  to  Achieve  (p).  The  number  of  intervals  beyond  (n)  to  achieve  (p). 
Clicking  the  [OK]  button  returns  you  to  the  Estimated  Errors  dialog  box. 

4.15  Cumulative  Curve  Chart 

From  the  Estimated  Errors  dialog  box  (see  Figure  4-10),  you  can  generate  two  different  graphs  of 
error  data:  the  cumulative  curve  and  incidence  curve  charts.  Click  the  [Cumulative  Curve]  button  to 
display  a  chart  showing  both  the  cumulative  number  of  actual  errors  per  input  interval  and  the 
estimated  cumulative  errors  (see  Figure  4-12).  When  the  chart  is  displayed,  use  the  File  menu  options 
to  dose,  save,  or  print  the  chart  (see  Section  4.2.8). 

4.16  Incidence  Curve  Chart 

In  the  Estimated  Errors  dialog  box,  click  the  [Inddence  Curve]  button  to  display  a  chart  showing  the 
number  of  actual  errors  per  input  interval  and  estimated  errors  (see  Figure  4-13).  When  the  chart  is 
displayed,  use  the  File  menu  options  to  close,  save,  or  print  the  chart  (see  Section  4.2.8). 

4.17  The  Data  Sheet 

The  Data  Sheet  option  on  the  Estimated  Errors  dialog  box  provides  you  access  to  the  data  used  to 
produce  both  the  inddence  and  the  cumulative  curves  (see  Figure  4-14).  You  may  use  the  File  menu 
options  to  dose,  save,  or  print  the  data  sheet  (see  Section  4.2.8).  Die  data  sheet  presents  the  following 
data  values: 
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Errors  Discovered  to  Date.  The  sum  of  the  number  of  errors  you  have  entered. 
Total  Errors  Projected.  The  estimated  number  of  errors  injected  in  the  software. 
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Figure  4-12.  Cumulative  Curve  Chart 

•  Percentage  of  Projected  Errors  Found  to  Date.  The  percentage  of  total  errors  estimated  to  be 
discovered  through  interval  (n). 

•  Estimated  Location  of  Peak.  The  peak  of  the  Rayleigh  curve  that  SWEEP  fits  to  your  data. 

In  addition,  the  SWEEP  data  sheet  lists  the  following  values  for  each  test  interval: 

•  Interval.  Ifest  interval  number. 

•  Actual  Error.  Your  input  data. 

•  Estimated  Error.  The  number  of  errors  estimated  by  SWEEP. 

•  Error  Delta.  The  difference  between  the  Actual  Error  and  Estimated  Error. 

•  Relative  Delta  (percentage).  The  difference  between  the  Error  Delta  and  the  Actual  Error, 
divided  by  the  Actual  Error. 
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SWEEP 


Figure  4-13.  Incidence  Curve  Chart 


•  Cumulative  Percentage  of  E.  The  estimated  percentage  of  Tbtal  Errors  Projected  through  (n) 
intervals. 

•  Actual  Cumulation.  The  sum  of  your  input  data  of  actual  errors. 

•  Estimated  Cumulation.  The  number  of  errors  through  the  interval  indicated,  as  calculated  by 
SWEEP. 

•  Cumulation  Delta.  The  difference  between  the  Actual  Cumulation  and  Estimated  Cumulation. 

•  Relative  Delta  (percentage).  The  difference  between  the  Cumulation  Delta  and  the  Actual 
Cumulation,  divided  by  the  Actual  Cumulation. 


4.2.8  The  File  Menu 

The  File  menu  offers  the  following  options: 

•  Quit.  Exits  both  the  application  and  Microsoft  Excel. 

•  Close .  Coses  the  active  document  and  returns  to  the  previous  screen. 
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Save.  Saves  the  displayed  document  to  a  tile.  A  dialog  box  appears  requesting  both  the  name 
of  the  directory  to  be  saved  to  and  name  of  the  file  to  be  saved.  Change  the  file  name  from  the 
current  file  name,  otherwise  SWEEP  will  overwrite  this  file  the  next  time  it  is  run. 
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Figure  4-14.  Time-Baaed  Model  Output  Data  Sheet 

Note:  If  you  attempt  to  save  a  data  sheet  in  a  directory  other  than  the  working  directory,  a 
macro  error  may  occur.  Just  rename  the  default  file  name  and  dick  the  [OK]  button 
to  save  the  file  properly  in  the  current  directory. 

•  Print  Preview,  lb  print  the  displayed  data  sheet,  select  Print  Preview  from  the  File  menu.  The 
standard  Excel  Print  Preview  window  appears. 

Note:  Attempting  to  print  when  no  printer  is  connected  may  result  in  a  macro  error.  If  this 
happens,  dick  the  [Halt]  button  in  the  Macro  Error  message  box,  select  Quit  from  the 
File  menu,  and  restart  SWEEP. 


4.2L9  The  Modes  Mend 

The  Modes  menu  is  dimmed  until  you  dose  the  displayed  document. 

43  MODE  2— PHASE-BASED  MODEL  OPERATION 

When  you  select  Mode  2 — Phase-Based  Model  from  the  Modes  menu,  the  Mode  2  Input  dialog  box 
(see  Figure  4-15),  which  requests  user  input,  is  displayed. 

43.1  Mode  2  Input  Dialog  Box 

The  Mode  2  Input  dialog  box  allows  you  to  select  and  manipulate  activity  sets  and  to  enter  data  for 
phases  in  an  activity  set.  The  default  defect  activity  set  has  six  phases  (see  Figure  4-15).  You  can  use 
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the  [Create]  button  to  design  an  activity  set  that  more  closely  matches  your  development  process  than 
those  given  in  SWEEP  (see  Section  4.3 3).  You  can  enter  up  to  three  additional  phases  and  can  custom¬ 
ize  the  names  of  all  the  phases  in  an  activity  set.  Alter  the  number  and  the  names  of  the  phases  in  an 
activity  set  tty  clicking  on  the  [Change]  button  (see  Section  43.4).  You  may  also  remove  activity  sets 
with  the  [Delete]  button  (see  Section  4.3.5).  The  Activity  Set  Selection  menu  provides  you  access  to 
any  stored  activity  sets. 


Mode  2  -  Input 
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Figure  4-15.  Mode  2  Input  Dialog  Bax 

You  enter  your  data  in  the  fields  in  the  Data  Input  Area.  Enter  a  minimum  of  two  values,  representing 
error  densities  estimated  during  particular  phases  of  the  life  cycle.  If  you  enter  fewer  than  two  inputs 
and  click  the  [OK]  button,  an  error  message  is  displayed.  The  input  values  must  be  greater  than  zero. 
After  you  enter  your  data,  click  the  [OK]  button.  SWEEP  will  compute  the  best  fit  for  the  data  you 
entered.  It  provides  a  plot  (see  Figure  4-16)  of  the  fit  as  a  function  of  the  location  of  the  peak.  You 
can  alter  the  peak  location  to  override  SWEEP’S  selection  of  the  best  fit.  (If  you  click  the  [Cancel] 
button  while  the  dialog  box  is  displayed,  you  will  return  to  the  Main  window  displaying  the  File  and 
Modes  menu  bar,  which  allows  you  to  exit  SWEEP  or  initiate  any  of  the  three  modes  of  SWEEP.) 

When  the  computation  is  complete,  SWEEP  displays  a  graph  that  indicates  where  the  peak  location 
of  the  distribution  will  occur,  based  on  the  input  data.  In  the  upper  right  corner  of  the  graph,  ihe  values 
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Figure  4-16.  Example  of  a  Line  Graph  Dialog  Bax 

for  D(B)  and  peak  location  (T)  are  displayed.  The  Line  Graph  dialog  box  and  the  corresponding  line 
graph  (shown  in  Figure  4-16)  appear  in  the  center  of  the  screen. 

43.2  Line  Graph  Dialog  Box 

The  line  Graph  dialog  box  offers  four  options: 

•  OK 

•  Print 

•  Save 

•  Cancel 


Clide  the  [OK]  button  assumes  that  you  accept  the  default  values  calculated  by  SWEEP  for  D(B)  and 
T  prompting  SWEEP  to  generate  a  Rayleigh  distribution  curve.  The  value  of  D(B)  directly  correlates 
to  a  value  of  T  (peak  location  of  Rayleigh  distribution  curve).  Section  43.6  describes  this  graph  in 
greater  detail. 

Click  the  [Print]  button  to  print  the  line  graph  to  a  printer  using  the  standard  Excel  Print  Preview 
functions.  Click  the  [Print]  option  to  open  the  Excel  Print  Preview  window.  Refer  to  the  Microsoft  Excel 
User’s  Guide  (Microsoft  1991)  for  details  on  the  Print  Preview  options. 

Click  the  [Save]  button  to  save  the  displayed  graph.  A  dialog  box  appears  requesting  the  file  name  to 
be  saved  and  the  directory  where  the  file  is  to  be  stored.  IMPORTANT:  Change  the  file  name  to  a 
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name  other  than  the  listed  default  name.  If  you  do  not  change  the  file  name,  the  file  is  saved  using  the 
current  file  name.  Be  aware  that  the  next  time  SWEEP  is  run,  it  will  overwrite  the  default  file  name 
and  the  previously  saved  data  will  be  lost  Therefore,  be  sure  to  use  a  unique  file  name  when  you  save 
a  graph. 

Nam  If  you  attempt  to  save  a  chart  in  another  directory,  a  macro  error  may  occur.  Just  rename  the 
default  chart  name  and  click  the  [OK]  button.  This  saves  the  chart  in  the  current  directory 
under  the  new  chart  name. 


4.33  Create  Activity  Set 

Creating  activity  sets  allows  you  to  customize  both  the  number  of  phases  and  the  name  for  each  phase. 
Clide  the  [Create]  button  in  the  Mode  2  Input  dialog  box  (see  Figure  4-15)  to  display  the  Create  Activ¬ 
ity  Set  dialog  box,  shown  in  Figure  4-17.  This  box  provides  you  with  the  input  fields  to  name  the  activity 
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Figure  4-17.  Create  Activity  Set  Dialog  Bax 

set  and  each  of  the  phases  in  the  activity  set.  A  pulldown  menu  allows  you  to  select  the  number  of 
phases.  You  may  designate  from  six  to  nine  phases.  Click  the  [OK]  button  to  verify  the  data  you  have 
entered  and  to  create  the  new  activity  set. 


: 
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4.  Operating  SWEEP 


Example:  You  want  to  create  an  activity  set  with  six  phases.  Click  the  [Create]  button  and  enter  My 
Activity  Set  as  the  activity  set  name.  Select  6  from  the  Number  of  Phases  pulldown  menu 
and  then  enter  Preliminary  Design,  Detailed  Design,  Coding,  Unit  Test,  Integration  Tfest, 
and  System  Test  into  the  first  six  phase-name  fields,  using  the  <TAB>  or  <DOWN 
ARROW>  keys  to  move  the  input  fields.  Click  the  [OK]  button  to  create  this  activity  set. 

43.4  Change  Activity  Set 

lb  change  the  phase  names  or  number  of  phases  in  an  activity  set  you  have  already  created,  dick  the 
[Change]  button  on  the  Input  dialog  box  (see  Figure  4-15).  This  displays  the  Change  Activity  Set  dialog 
box  (see  Figure  4-18).  From  this  box,  you  can  select  the  activity  set  to  be  modified,  change  the  number 
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Figure  4-18.  Change  Activity  Set  Dialog  Boot 


of  phases,  and  modify  the  phase  names.  You  cannot  change  the  activity  set  name  after  you  have  created 
the  activity. 
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Example;  In  the  activity  set  you  created,  you  want  to  change  a  Phase  2  name  to  Design  and 
Prototyping.  Click  the  [Change]  button  on  the  Input  dialog  box.  On  the  Change  Activity  Set 
dialog  box,  select  the  activity  set  you  want  to  modify  (My  Activity  Set)  from  the  Activity  Set 
Name  pulldown  menu.  Clide  the  Phase  2  input  field  and  press  the  Delete  key  on  the  key¬ 
board  to  erase  the  old  phase  name.  Then  enter  Design  and  Prototyping  and  click  the  [OK] 
button. 

43.5  Delete  Activity  Set 

Tb  delete  an  activity  set,  click  the  [Delete]  button  on  the  Input  dialog  box  (see  Figure  4-15).  This 
displays  the  Delete  Activity  Set  dialog  box  (see  Figure  4-19).  Select  the  activity  you  want  from  the 
Activity  Set  Name  pulldown  menu  and  click  the  [OK]  button. 
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Figure  4-19.  Delete  Activity  Set  Dialog  Box 
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43.6  Rayleigh  Distribution  Graph 

An  example  of  a  Rayleigh  distribution  graph  is  shown  in  Figure  4-20. 
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Figure  4-20.  Example  of  a  Rayleigh  Distribution  Graph 
The  graph  provides  the  following  information: 

•  Vertical  axis  (errors) 

•  Horizontal  axis  (phases) 

•  Blue  bars 

The  data  you  input  is  represented  as  the  bar  on  the  left  hand  side  for  each  phase.  If  no  input 
data  exists  for  a  phase,  the  bar  is  not  displayed. 

•  Red  bars 

The  fitted  data  is  represented  as  the  bar  on  the  right  hand  side  for  each  phase.  If  no  input  data 
exists  for  a  phase,  only  this  bar  is  displayed. 

•  Selected  peak  location 

•  Tbtal  errors  injected 
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•  Latent  errors 

•  Error  discovery  efficiency 

After  the  graph  has  been  displayed,  you  can  select  an  option  from  the  File  menu. 

Note:  If  errors  are  peaking  toward  the  end  phases,  the  error  bars  may  conflict  with  the  information 
in  the  top  right  comer  of  the  graph. 

43.7  The  File  Menu 

The  File  menu  offers  the  following  options: 

•  Quit.  Exits  the  application  and  Microsoft  Excel. 

•  Close.  Closes  the  Rayleigh  distribution  graph  and  returns  to  the  Mode  2  Input  dialog  box. 

•  Save.  Saves  the  displayed  graph  to  a  file.  A  dialog  box  appears  requesting  both  the  name  of  the 
file  to  be  saved  and  the  directory  where  the  file  is  to  be  stored.  IMPORTANT:  Change  the  file 
name  to  a  name  other  than  the  current  file  name.  Be  aware  that  the  next  time  SWEEP  is  run, 
it  will  overwrite  the  default  file  name.  Therefore,  be  sure  to  use  a  unique  file  name  when  you 
save  a  graph. 

Notes:  If  you  attempt  to  save  a  graph  in  another  directory,  a  macro  error  may  occur.  Just 
rename  the  default  graph  name  and  elide  the  [OK]  button.  This  saves  the  graph  in  the 
current  directory  using  the  new  graph  name. 

•  Print  Preview.  Tb  print  the  displayed  graph,  select  Print  Preview  from  the  File  menu.  The 
standard  Microsoft  Excel  Print  Preview  window  appears. 

Note:  Attempting  to  print  when  no  printer  is  connected  may  result  in  a  macro  error.  If  this 
happens,  click  the  [Halt]  button  in  the  Macro  Error  message  box,  select  Quit  from  the 
File  menu,  and  restart  SWEEP. 

43.8  The  Modes  Menu 

The  Modes  menu  is  dimmed  until  you  exit  Mode  2. 

4.4  MODE  3— PLANNING  AID  OPERATION 

Mode  3 — Planning  Aid  helps  you  to  perform  a  statistical  quality  control  of  the  software  product  under 
development  and  provides  you  with  the  capability  to  set  goals  for  error  discovery.  SWEEP  provides 
you  with  a  corresponding  error  discovery  profile  on  a  phase-by-phase  basis.  You  either  enter  an  esti¬ 
mate  for  total  errors  injected  and  the  estimated  peak  location  of  the  distribution  over  the  phases,  or 
enter  an  estimate  for  total  errors  injected  and  the  latent  error  content.  You  can  enter  tolerances  for 
the  goals  you  set.  You  can  choose  either  a  percentage  of  the  errors/phase  or  a  signal-to-noise  approach 
to  total  errors  (see  Appendix  A). 

4.4.1  Mode  3  Input  Dialog  Box 

When  you  select  Mode  3 — Planning  Aid  from  the  Modes  menu,  the  Mode  3  Input  dialog  box  (see 
Figure  4-21)  is  displayed.  The  dialog  box  provides  the  following: 
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•  Activity  Set  Selection  pulldown  menu 

•  Activity  set  manipulation  buttons 

•  Input  Data  Options  radio  buttons 

•  Iblerance  Options  data  options 

•  Input  fields  for  Estimated  Tbtal  Injected  Errors 

•  [OK]  button 

•  [Cancel]  button 


Mode  3  -  Input 


-Rctivity  Set  Options - 

Rctiuity  Set  Selection: 

Number  of  Phases: 


Default 


Shorn 


if  .1 


Delete 


-Input  Data  Options - 

®  Estimate  for  Peak  Location  (T) 
Estimate  for  Latent  Error  Content  (L) 


-Tolerance  Options - 

O  Upper/Louier  Tolerances 
Q  Signal  to  Noise  Ratio 


-Data  Entry  Area - 

Estimated  Total  Injected  Errors: 


Peak  Location  (T): 

Upper  Tolerance  (%): 
Louse?  Tolerance  i%‘v. 
Signal  to  Noise  Ratio: 


Figure  4-21.  Mode  3  Input  Dialog  Box 

Thev4cn'vt(y  Set  Selection  pulldown  menu  lets  you  select  an  activity  set  to  use  in  the  estimate.  The, four  activity 
set  buttons — [Show],  [Create],  [Change],  and  [Delete] — let  you  create  and  manipulate  new  activity 
sets. 

The  Input  Data  Options  radio  buttons  letyou  select  whether  to  input  an  estimate  either  for  peaklocation 
of  errors  or  for  latent  error  contents.  (SWEEP  computes  an  estimate  for  whichever  value  you  do  not 
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input.  If  you  input  peak  location,  SWEEP  computes  latent  error  content,  and  if  you  input  latent  error 
content,  SWEEP  computes  peak  location.)  The  Input  fields  (in  the  Data  Entry  Area)  allowyou  to  enter 
values  for  Estimated  Ibtal  Injected  Errors.  Depending  on  which  Input  data  option  you  select,  enter 
either  Peak  Location  (T)  or  Latent  Error  (L). 

You  must  select  an  activity  set  name  from  the  pulldown  menu,  lb  create  a  new  activity  set,  click  the 
[Create]  button.  Entering  a  value  in  the  Estimated  Ibtal  Injected  Errors  field  is  mandatory;  the  value 
must  be  greater  than  zero.  Enter  a  value  for  either  the  peak  location  or  the  latent  error  content,  which¬ 
ever  is  displayed.  Select  the  appropriate  input  field  and  enter  a  value.  Values  for  peak  location  must 
be  between  0.5  and  the  number  of  phases  (m).  Values  for  latent  error  content  must  be  greater  than 
zero. 

Entering  the  tolerance  input  values  is  optional.  You  have  the  choice  of  either  selecting  a  tolerance  by 
inputting  an  upper  and  lower  tolerance,  based  on  a  percentage  of  the  nominal  value  calculated,  or 
using  a  signal-to-noise  ratio  approach.  You  must  select  either  Upper/Lower  Ibierances  or  Signal  to 
Noise  Ratio  using  the  Iblerance  Options  radio  buttons. 

If  you  select  Upper/Lower  Ibierances,  you  must  enter  values  for  the  Upper  Iblerance  (%)  and  Lower 
Iblerance  (%)  input  fields.  If  you  enter  values  for  these  fields,  they  must  be  between  0  and  100. 

If  you  select  Signal  to  Noise  Ratio,  enter  a  value  greater  than  or  equal  to  1.0  in  the  Signal  to  Noise 
Ratio  input  field. 

Norm.  If  you  do  not  enter  any  tolerance  values,  the  resultant  graph  will  show  the  upper  and  lower 
tolerances  equal  zero  in  the  Rayleigh  distribution  curve. 

Click  the  [OK]  button  to  compute  the  Rayleigh  distribution  curve  and  to  display  a  Rayleigh 
distribution  graph  based  on  the  inputs. 

Clide  the  [Cancel]  button  at  any  time  to  dose  the  dialof  sox  and  return  to  the  Main  window  displaying 
the  File  and  Modes  menu  bar. 

4.4.2  Create  Activity  Set 

Creating  activity  sets  allows  you  to  customize  both  the  number  of  phases  and  the  name  for  each  phase. 
Click  the  [Create]  button  in  the  Mode  3  Input  dialog  box  (see  Figure  4-21)  to  display  the  Create  Activ¬ 
ity  Set  dialog  box,  shown  in  Figure  4-17.  This  box  provides  you  with  the  input  fields  to  name  the  activity 
set  and  each  phase,  and  a  pulldown  menu  to  select  the  number  of  phases.  You  may  designate  from  six 
to  nine  phases.  Click  the  [OK]  button  to  verify  the  data  you  have  entered  and  to  create  the  new  activity 
set 

Example:  You  want  to  create  an  activity  set  with  six  phases.  Click  the  [Create]  button  and  enter  My 
Activity  Set  as  the  activity  set  name.  Select  6  from  the  Number  of  Phases  pulldown  menu 
and  then  enter  Preliminary  Design,  Detailed  Design,  Coding,  Unit  Ifest,  Integration  Ifest, 
and  System  Ifest  into  the  name  fields  for  the  first  six  phases,  using  the  <TAB>  or  <DOWN 
ARROW>  keys  to  move  the  input  fields.  Click  the  [OK]  button  to  create  this  activity  set. 

4.43  SHOwAcnvrrvSiiT 

You  can  view  the  phases  of  an  activity  set  by  selecting  the  activity  set  from  the  Activity  Set  Selection 
pulldown  menu  in  the  Mode  3  Input  dialog  box  (see  Figure  4-21  )and  clicking  the  [Show]  button.  This 
displays  the  phases  of  the  selected  activity  (see  Figure  4-22).  Click  the  [Done]  button  to  return  to  the 
Input  dialog  box  (see  Figure  4-22). 


Shorn  flctiuity  Set 


Activity  Set  Name:  Default 
Number  of  Phases:  6 
rPhose  Names - 


Phase  1:  Preliminary  Design 
Phase  2:  Detailed  Design 
Phase  3:  Coding 
Phase  4:  Unit  Test 
Phase  5:  Integration  Test 
Phase  6:  System  Test 


l 


Done 


3 


Figure  4-22.  Show  Activity  Set  Dialog  Be* 


4.4.4  Change  Activity  Set 

lb  change  the  phase  names  or  number  of  phases  in  an  activity  set  you  have  already  created,  elide  the 
[Change]  button  on  the  Input  dialog  box  (see  Figure  4-21).  This  displays  the  Change  Activity  Set  dialog 
box  (see  Figure  4-18).  From  this  box,  you  can  select  the  activity  set  to  be  modified,  change  the  number 
of  phases,  and  modify  the  phase  names.  You  cannot  change  the  activity  set  name  after  you  have  created 
the  activity. 

Example:  In  the  activity  set  you  created,  you  want  to  change  a  Phase  2  name  to  Design  and 
Prototyping.  Click  the  [Change]  button  in  the  Input  dialog  box.  On  the  Change  Activity 
dialog  box,  select  the  activity  set  you  want  to  modify  (My  Activity  Set)  from  the  Activity  Set 
Name  pulldown  menu.  Click  the  Phase  2  input  field  and  press  the  Delete  key  on  the 
keybo?  -1  to  erase  the  old  phase  name.  Then  enter  Design  and  Prototyping  and  click  the 
[OK]  button. 

4.4.5  Delete  Activity  Set 

lb  delete  an  activity  set,  click  the  [Delete]  button  on  the  Input  dialog  box  (see  Figure  4-21).  This 
displays  the  Delete  Activity  Set  dialog  box  (see  Figure  4-19).  Select  the  activity  you  want  from  the 
Activity  Set  Name  pulldown  menu  and  click  the  [OK]  button. 

4.4.6  Rayleigh  Distribution  Graph 

When  you  select  either  an  Input  Data  Option  or  Tblerance  Option  from  the  Mode  3  Input  dialog  box 
(see  Figure  4-21),  SWEEP  generates  a  graph  entitled  Error  Statistical  Quality  Control  Chart.  The 
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graph  can  display  two  types  of  data.  A  sample  Mode  3  graph  is  shown  in  Figure  4-23.  If  you  input  the 
estimated  peak  location,  the  estimated  latent  error  content  is  displayed;  if  you  input  an  estimated  la¬ 
tent  error  content,  the  graph  displays  the  estimated  peak  location  of  the  distribution.  When  the  graph 
is  displayed,  use  the  File  menu  operations  to  close,  save,  or  print  the  graph.  (See  Section  4.4.7.) 


Note:  If  you  have  chosen  total  errors  and  latent  errors,  and  the  number  of  latent  errors  is 
unacceptably  high,  SWEEP  will  indicate  this  by  displaying  a  peak  location  (T)  greater 
than  the  number  of  phases. 


4,4.7  The  File  Menu 

The  File  menu  offers  the  following  options: 

•  Quit.  Exits  both  the  application  and  Microsoft  Excel. 

•  Oose.  Closes  the  active  document  and  returns  to  the  previous  screen. 

•  Save.  Saves  the  displayed  document  to  a  file.  A  dialog  box  appears  requesting  both  the  name 
of  the  directory  to  be  saved  to  and  the  name  of  the  file  to  be  saved.  Change  the  file  name  from 
the  current  file  name,  otherwise  SWEEP  will  overwrite  this  file  the  next  time  it  is  run. 

Note:  If  you  attempt  to  save  a  data  sheet  in  another  directory,  a  macro  error  may  occur.  Just 
rename  the  default  file  name  and  click  the  [OK]  button.  This  saves  the  file  in  the 
current  directory. 
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•  Print  Preview.  Tb  print  the  displayed  data  sheet,  select  Print  Preview  from  the  File  menu.  The 
standard  Excel  Print  Preview  window  appears. 

Note.  Attempting  to  print  when  no  printer  is  connected  may  result  in  a  macro  error.  If  this 
happens,  dick  the  [Halt]  button  in  the  Macro  Error  message  box,  select  Quit  from  the 
Hie  menu,  and  restart  SWEEP. 

4.4.8  Tbs  Moras  Menu 

The  Modes  menu  is  dimmed  until  you  exit  the  mode. 


This  page  intentionally  left  blank. 


5.  CASE  STUDY 


5.1  OVERVIEW 

This  section  shows  how  you  can  use  SWEEP’S  time-based  mode!  (Mode  1)  to  help  you  to  answer 
certain  questions  of  practical  interest,  such  as:  How  many  problems  should  be  expected  to  be  found 
during  the  next  five  months  of  testing?  The  application  of  Mode  1  to  the  analysis  of  error  (or  defect 
or  problem)  detection  occurrence  data  is  illustrated  here  by  applying  it  to  data  from  a  software 
development  project  conducted  at  the  Raytheon  Company  (Rome  Air  Development  Center  1977). 
You  can  use  Mode  1  to  set  parameters  for  data  related  to  the  detection  of  problems  during  the  testing 
process.  Thus,  you  can  make  estimates  of  the  number  of  problems  expected  to  be  observed  later,  over 
a  period  of  time  that  you  specify.  You  can  also  estimate  the  number  of  time  intervals  required  to  detect 
(and  remove)  a  specified  percentage  of  the  total  number  of  defects  expected  to  be  injected  into  your 
software  product  during  its  development.  You  can  also  use  SWEEP  to  make  projections  of  the  outages 
of  an  overall  system  caused  by  software,  hardware,  or  procedures  (Gaffney,  Ferrando,  and  Jones 
1993). 

SJ,  DEFECT  OR  PROBLEM  DATA  ANALYSIS  USING  SWEEP 

Mode  1  can  help  you  to  address  issues  related  to  the  number  of  defects  or  problems  you  expect  to  find 
during  the  development  of  a  software  product.  Refer  to  Section  4  as  you  read  and  think  about  the 
material  in  this  subsection.  Section  4  shows  you  what  the  dialog  boxes  for  SWEEP  look  like,  thus 
indicating  where  the  values  of  various  parameters  that  you  enter,  or  that  are  produced  by  SWEEP,  are 
available  to  you.  This  case  study  (see  Sections  53  and  5.4)  illustrates  how  you  can  obtain  the  values 
for  some  of  these  parameters.  Mode  1  can  help  you  to  determine  the  answers  to  the  following 
questions: 

•  How  many  defects  should  I  expect  to  find  during  the  next  m  intervals  of  testing? 

You  may  wish  to  know  if  it  will  be  cost-effective  to  continue  testing  (and  finding  and  removing 
bugs)  for  some  period  of  time  in  additior  \c  time  your  organization  has  already  invested 
for  this  purpose.  An  increasing  amount  of  effort  will  probably  be  required  to  find  and  correct 
each  problem  found  in  the  software  product  as  the  test  process  continues.  This  is  a  manifesta¬ 
tion  of  the  law  of  diminishing  returns,  the  increasing  marginal  cost  for  identifying  problems 
during  the  period  of  testing  a  software  product  You  can  use  SWEEP  to  estimate  how  many 
defects  you  should  expect  to  find  in  testing  over  the  next  m  time  intervals.  This  information 
can  help  you  to  estimate  the  costs  of  discovering  defects  in  the  software  system  you  are  devel¬ 
oping.  If  it  costs  $X  per  time  increment  to  staff  the  testing  function,  then  it  will  cost  an  average 
of  SXJ B  per  problem,  if  B  problems  are  found  during  the  next  m  time  increments.  Generally 
expect  (and  SWEEP  can  estimate  for  you)  that  the  value  of  the  parameter  B  will  decrease  over 
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time,  after  the  peak  rate  of  defect  discovery  has  been  reached  (at  time  interval  tp).  Thus,  the 
average  cost  of  discovering  a  problem,  $XJB,  will  increase  over  that  period  of  time. 

•  How  long  should  I  expect  to  test  this  software  in  order  for  it  to  be  problem-free  (to  find 
essentially  all  of  the  errors  in  it)? 

You  might  find  it  useful  to  know  how  much  additional  test  time  is  (estimated  to  be)  required 
in  order  to  find  essentially  all  of  the  problems  in  the  software  product  being  tested.  Clearly, 
this  figure  is  likely  to  be  excessively  high,  but  it  is  an  upper  bound  to  the  required  test  time, 
and  you  might  find  it  of  interest.  SWEEP  tells  you  how  many  time  intervals  would  be  expected 
to  be  required,  in  total,  to  find  99.99  percent  of  the  (estimated)  total  number  of  errors  injected 
during  the  development  process  (the  total  number  that  are  potentially  discoverable,  the  pa¬ 
rameter  E,  the  (estimated)  total  number  of  errors,  as  described  in  detail  in  Appendix  A).  E 
is  one  of  the  two  parameters  of  the  Rayleigh  curve  that  SWEEP  fits  to  the  defect  or  problem 
discovery  data  that  you  provide  to  Mode  1.  The  other  parameter  is  the  location  of  the  peak 
of  this  fitted  curve,  tp;  the  peak  location  is  the  number  of  the  time  interval  at  which  the  highest 
rate  of  problem  discovery  is  estimated  to  be  experienced. 

•  How  many  defects  should  I  expect  to  be  injected  into  the  software  system  during  the 
integration  process?  What  is  the  profile  of  defect  or  problem  discovery  that  I  should  expect 
to  be  experienced  during  testing? 

Use  experience  on  past  projects  to  help  you  to  develop  your  initial  testing  budget.  Then,  after 
the  testing  process  has  been  underway  for  awhile,  review  this  budget  and  determine  if  it  needs 
to  be  revised— increased  or  decreased — in  accordance  with  the  actual  experience  on  this  proj¬ 
ect.  You  use  SWEEP  to  provide  data  you  can  use  in  developing  your  budget  update.  Use 
SWEEP  to  develop  a  projection  for  the  number  of  problems  or  defects  that  you  expect  to  be 
found  during  the  remaining  expected  test  time.  If  this  figure  is  higher  than  your  earlier  esti¬ 
mate,  it  would  probably  be  wise  for  you  to  revise  your  test  budget  upward,  based  on  the  in¬ 
creased  number  of  defects  that  you  now  expect  to  find.  Alternatively,  if  the  number  of 
problems  expected  is  lower  than  your  original  estimate,  you  may  wish  to  revise  your  test  budget 
downward.  SWEEP  provides  an  estimate  of  the  total  number  of  problems  that  you  should  ex¬ 
pect  to  find  during  the  remainder  of  the  scheduled  testing  period  (i.e.,  during  the  remaining 
m  time  intervals).  It  provides  an  estimate  of  the  number  of  problems  that  you  should  expect 
to  find  during  eadi  of  those  m  intervals  (the  remainder  of  the  defect  or  problem  detection  pro¬ 
file).  An  estimate  based  on  the  projection  provided  by  SWEEP  should  be  better  than  an  esti¬ 
mate  that  was  made  before  the  project  got  underway,  based  on  historical  data.  If  you  use 
Mode  1  to  provide  estimates  of  the  values  of  the  parameters  of  the  profile  of  problem  discov¬ 
ery  during  the  integration  process,  then  the  value  of  the  parameter  E,  the  (estimated)  total 
number  of  errors,  would  be  the  estimate  of  the  total  number  of  errors  or  defects  expected  to 
be  injected  into  the  software  product  during  the  integration  process. 

S3  CASE  STUDY  PROJECT  DESCRIPTION 

The  iTaytheon  Company  developed  project  Q.  The  integration  testing  for  project  Q  was  performed 
over  a  period  of  approximately  33  months.  During  that  time,  there  were  about  40  releases  of  the  prod¬ 
uct,  which  reflected  error  corrections,  design  changes,  and  improvements.  During  the  33-month  time 
interval,  1,198  problems  were  reported,  and  their  times  of  occurrence  were  recorded.  Thble  5-1 
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presents  the  number  (out  of  the  total  of  1,198)  of  defects  or  problems  discovered  during  each  of  the 
33  months.  This  table  also  presents  the  cumulative  number  of  problems  or  defects  discovered  from 
the  beginning  of  testing  through  each  month.  Project  Q  personnel  plotted  the  number  of  problems 
found  each  month.  They  noted  that  the  incidence  of  defect  discovery  as  a  function  of  time,  over  the 
33-month  period,  had  peaks  and  valleys.  Figure  5-1  is  a  plot  of  the  incidence  curve  produced  by 


Figure  5-1.  Error  Incidence  for  33  Months 


SWEEP.  This  curve  shows  these  peaks  and  valleys  as  illustrated  by  the  number  of  defects  discovered 
per  month  over  the  33-month  period.  This  data  is  also  provided  in  Thble  5-1.  Project  Q’s  analysis  of 
the  defect  discovery,  or  problem  incidence,  data  showed  that  the  peaks  occurred  at  the  times  of  release 
of  the  builds  when  several  problems  had  been  resolved.  The  valleys  occurred  when  the  testing  of 
particular  functions  took  place,  when  the  development  team  was  preparing  to  work  on  the  next 
release. 


5.4  APPLICATION  OF  SWEEP  TO  THE  RAYTHEON  PROJECT  DATA 

This  section  describes  the  results  of  applying  Mode  1  to  the  Raytheon  Project  Q  data  presented  in 
Thble  5-1.  Various  parameters  that  one  could  use  to  answer  questions  of  interest  to  project 
development  management,  such  as  those  given  in  Section  5.2,  are  described  here. 

The  counts  of  defects,  or  problems  found  in  each  of  the  33  months  were  provided  to  SWEEP,  which 
fit  a  Rayleigh  curve  to  them.  The  actual  numbers  of  problems  found  per  month  for  33  months  (see 
Thble  5-1)  and  SWEEP’S  fit  to  these  data  are  shown  graphically  in  Figure  5-1  and  in  tabular  form  in 
Figures  5-3  and  5-4.  The  cumulative  number  of  problems  found  through  each  month  from  1  to  33  (see 
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Tkble  5-1)  and  SWEEP’S  fit  to  these  data  are  shown  graphically  in  Figure  5-2  and  in  tabular  form  in 
Figures  5-3  and  5-4. 


Tkble  5-1.  Defect  Discovery  History 


Figure  5-2.  Cumulative  Number  of  Errors  for  33  Months 


Using  Figure  5-2  and  Thble  5-1,  we  can  summarize  results  of  applying  SWEEP  to  this  data.  SWEEP 
estimated  that: 

•  99.9  percent  of  the  total  number  of  defects  anticipated  to  be  injected  into  the  system  would 
be  discovered  by  month  49. 

•  The  total  number  of  estimated  defects  injected  into  the  software  was  1,250. 

•  There  would  be  52  defects  remaining  in  the  system  at  the  end  of  month  33.  The  number  of  de¬ 
fects  remaining  is  calculated  by  subtracting  the  actual  total  number  of  defects  found  (1,198) 
from  the  estimated  total  number  of  defects  (1,250).  Realistically,  it  is  not  very  likely  that  every 
problem  would  be  discovered. 

•  The  value  of  the  peak  location  was  interval  13. 

Figures  5-2  and  54  are  images  of  the  data  sheet  produced  by  SWEEP  when  it  processed  the  data  for 
33  months  of  problem  discovery.  Shown  are  the  actual  and  SWEEP-fit  problem  counts,  as  well  as 
cumulative  counts  on  a  month-by-month  basis.  Additionally,  the  cumulative  percentage  of  total 
problems  injected  on  a  month-by-month  basis  is  indicated. 


5-5 


Errors  Discovered  to  Dete:  1 198 
Total  Errors  Projected:  1 250.37 
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Errors  Discovered  to  Date:  1 !  98 


5.  Cue  Study 
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Figure  5-4.  Data  Sheet  for  Fit  to  33  Months  of  Data,  Part  2 


Next,  the  ability  of  Mode  1  to  make  defect  discovery  projections  was  investigated.  A  situation  in  which 
only  23  months  of  testing  had  been  done  was  simulated.  SWEEP  was  used  to  project  the  defect,  or 
problem,  discovery  for  10  more  months  (i.e.,  through  month  33),  hased  on  the  counts  of  problems 
found  during  months  1  through  23.  SWEEP  estimated  that: 


•  The  total  number  number  of  defects,  E,  injected  into  the  software  would  be  1,455  (  as  com¬ 
pared  with  the  1,250  estimated,  based  on  33  months  of  data— just  16.4  percent  more  than  the 
1,250  based  on  the  larger  data  set  of  33  points). 

•  The  peak  location  was  interval  15  (rather  than  13,  based  on  33  months  of  data). 

•  A  total  of 1,329  problems,  or  defects,  would  be  found  in  the  33-month  period.  The  actual  figure 
was  1,198.  Thus,  SWEEP  over-estimated  the  number  of  defects  to  be  found  over  the  33-month 
period,  based  on  23  months  of  data,  by  131 — or  10.9  percent 

The  actual  numbers  of  problems  found  per  month  for  23  months  (see  Thble  5-1)  and  SWEEP’S  fit  to 
these  data  are  shown  in  Figure  5-5.  The  cumulative  number  of  problems  found  through  each  month 
from  1  to  23  (see  Thble  5-1)  and  SWEEP’S  fit  to  these  data  are  shown  in  Figure  5-6. 


Figure  5-5.  Error  Incidence  for  23  Months 


Figures  5-7  and  5-8  are  images  of  the  data  sheet  produced  by  SWEEP  when  it  processed  the  data  for 
23  months  of  problem  discovery.  Shown  are  the  actual  and  SWEEP-fit  problem  counts,  as  well  as  cu¬ 
mulative  counts  on  a  month-by-month  basis.  Additionally,  the  cumulative  percentage  of  total  prob¬ 
lems  injected  on  a  month-by-month  basis  is  provided  on  the  data  sheet.  Note  from  the  data  presented 
in  Figure  5-8  that  99.6  percent  of  the  defects  injected  (estimated  to  be  1,455)  would  be  found  by  the 
end  of  month  49,  based  on  data  from  months  1  through  23. 
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Figure  5-6.  Cumulative  Number  of  Errors  for  23  Months 


5.5  CONCLUSION 

This  case  study  demonstrates  the  types  of  information  that  can  be  made  available  to  software 
development  projects  using  SWEEP.  Project  managers  can  use  the  estimated  number  of  defects  when 
planning  and  scheduling  testing  cycles  and  resource  levels.  The  total  number  of  defects  can  be  used 
to  monitor  progress  against  a  project’s  quality  goals.  The  study  also  shows  that  SWEEP  can  be  used 
to  refine  estimates  by  entering  actual  project  data. 

After  23  months  of  testing  and  repairing  defects,  SWEEP  predicted  the  number  of  defects  discovered 
and  repaired  after  33  months  within  11  percent  of  the  actual  number.  Also  at  23  months,  SWEEP  esti¬ 
mated  within  about  16  percent  the  total  number  of  injected  errors  to  be  found  after  33  months.  Finally, 
SWEEP  estimated  that  after  23  months  of  testing  and  development,  69  percent  of  the  defects  had  been 
removed,  and  that  after  49  months,  99.6  percent  of  the  defects  would  be  removed. 

After  33  months  of  testing  and  repairing  the  software,  SWEEP  predicted  that  95  percent  of  the  errors 
had  been  removed  and  that  after  49  months,  99.9  percent  of  the  defects  would  be  removed. 


Errors  Discovered  to  Dote:  1013 
Totel  Errors  Projected:  1 455.04 
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5.  Cmc  Study 
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APPENDIX:  SOFTWARE  ERROR  ESTIMATION 
MATHEMATICAL  MODELS 


A.1  OVERVIEW 

This  section  describes  the  mathematical  models  implemented  by  Modes  1, 2,  and  3  of  SWEEP.  Mode  1 
provides  time-based  error  estimation,  using  error  discovery  data  available  on  a  time  basis.  It  is  pre¬ 
ferred  that  the  time  variable  should  be  the  system  run  time  during  which  error  data  was  collected,  rath¬ 
er  than  calendar  time,  and  should  be  normalized  so  that  each  interval  covers  the  same  duration. 
SWEEP  fits  this  data  to  a  Rayleigh  curve.  Mode  2  provides  a  phase-based  approach  to  error  estima¬ 
tion,  fitting  available  data  on  a  phase-by-phase  basis  to  a  Rayleigh  curve.  This  mode  is  based  on  the 
user  providing  a  minimum  of  two  successive  (but  not  necessarily  contiguous)  development  phases. 
SWEEP  estimates  errors  for  the  remaining  phases  and  the  latent  error  content  Mode  3  lets  the  user 
set  objectives  for  error  discovery  ranges  based  on  user-selected  criteria.  It  serves  as  a  planning  aid  for 
forecasting  the  number  of  errors  you  expect  to  find  during  the  development  process. 

A.2  MODE  1— TIME-BASED  MODEL  APPROACH  TO  ERROR  ESTIMATION 


Mode  1  fits  a  Rayleigh  curve  to  the  counts  of  errors  discovered  over  time.  The  user  works  with 
currently  available  test  data.  The  data  used  is  the  number  of  errors  discovered  and  the  number  of  test 
intervals  executed  to  date.  The  user  must  normalize  the  error  data  so  that  each  interval  covers  approxi¬ 
mately  the  same  amount  of  test  time.  For  example,  if  the  time  interval  for  week  one  was  24  hours/day, 
but  the  time  intervals  for  weeks  two,  three,  and  four  were  each  8  hours/day,  the  data  for  weeks  two, 
three,  and  four  could  be  combined  into  a  virtual  week  two.  The  error  discovery  data  must  be  for  the 
same  unit  of  code  during  all  of  the  test  intervals  used. 


Let  U(I)  be  the  number  of  errors  that  you  have  found  for  the  interval  I  (I  =  1,  ...n,  nslOO).  You  must 
enter  data  for  at  least  two  intervals.  The  algorithm  calculates  the  parameter  A(I)  for  each  U(I)  accord¬ 
ing  to  whether  you  have  entered  a  value  for  U(I).  A(I) = 1  if  you  have  entered  a  value  for  U(I);  A(I) =0 
otherwise. 


The  following  is  a  description  of  the  nature  of  the  iterative  algorithm  that  determines  the  two 
parameters  of  the  Rayleigh  curve  (E  =  the  total  area,  and  tp  =  the  peak)  that  best  fit  the  data  (the 
U(I)  that  you  have  entered).  First  SWEEP  iterates  through  T  (the  trial  values  of  tp),  ranging  from  in¬ 
terval  number  0.5  in  increments  of  0.1  to  300,  or  2,996  iterations,  to  find  the  smallest  value  of  D(B), 
the  squared  error,  (the  square  of  the  difference  between  the  data  points  that  you  enter  and  the  values 
generated  by  SWEEP).  SWEEP  uses  the  following  formulas: 


and 


T  = 


1 

M 


A-l 


E(B)  = 


£a<I)  •  «W 
1-1 _ 

£A(I)  •  (e-^-^-e-B-i*) 
1-1 


DV(I,B)  =  E(B)  •  (c-^-J)1^11) 


D(B)  =  £a(I)  •  (U(I)-DV(I,B))2 
1-1 

Let  BMIN  be  the  value  of  B  that  produced  D(BMIN),  the  smallest  value  of  D(B)  over  the  range  of 
B.  Then,  the  best  estimate  of  tp,  denoted  as  TMIN,  is  given  by: 

TMIN  =  -rJ=r 

72BMIN 

Also,  E(B)  is  the  value  of  Rayleigh  E — the  total  errors  associated  with  the  trial  value  of  B.  Thus,  the 
best  estimate  of  E,  the  total  lifetime  errors,  symbolized  as  E(BMIN),  is  given  by: 

£a<I)  .  U(I) 

E(BMIN)  =  — - — - 

^  A(I)  •  (e-»**nNa-i)a  -  e-BMiN-i*) 

1-1 

SWEEP  allows  you  to  repetitively  calculate  Sm— the  estimated  number  of  errors  to  be  discovered 
during  the  next  (m)  time  intervals.  Do  this  if  you  want  to  know  how  many  errors  are  estimated  to  be 
discovered  during  the  next  m  intervals.  You  must  provide  this  (m)  value  for  SWEEP  to  perform  these 
calculations.  Sq  is  calculated  as  follows: 

Sm  =  ^E(BMIN)  •  |i_c-BMiN  (n+»)Jj j  _  |e(BMIN)  •  (l-e-BMm'nI)) 
or,  simplifying: 


Sm  =  E(BMIN)  •  |e_BMIN '  nJ-e-BMIN  •  (n+m)1  j 

SWEEP  also  allows  you  to  repetitively  calculate  k,  the  total  number  of  time  intervals  required  to  yield 
a  particular  percentage  (p)  of  the  total  number  of  errors  (E).  You  must  provide  this  (p)  value  for 
SWEEP  to  perform  these  calculations,  and  the  percentage  (p)  must  be  less  than  or  equal  to  99.99. 

SWEEP  will  also  calculate  r,  where  r=k-n,  the  number  of  additional  intervals  beyond  n,  required  to 
reach  p.  Remember  that  n  is  the  number  of  intervals  for  which  error  data  is  available. 

lb  calculate  r,  you  must  first  solve  k  using  the  following  equation: 
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P  _  /-i  „-BMIN-k*\ 

100  l1  I 


Rewriting  the  equation,  you  obtain: 


and: 


Therefore: 


Finally: 


»-BMIN-kJ  _  i _ P 


100 


-  BMIN  •  k2  =  loge(l  - 
(loge(!  -  ife)) 


-  BMIN 


/ 


(i°fe(i-ifo) 


-BMIN 


The  value  of  r  is  given  by  r=k-n,  where  r  is  the  additional  number  of  intervals  over  which  testing  must 
be  done  to  reach  (p)  percentage  of  (E),  the  total  number  of  errors  injected  during  development.  Hav¬ 
ing  solved  for  k,  SWEEP  can  now  calculate  S*,  the  estimated  total  number  of  errors  that  will  be 
discovered  through  time  interval  k. 

S*  is  determined  using: 

Sk  =  E{BMEN)  •  (l-e-BM*N  k*j 

Sr,  the  estimated  number  of  errors  that  will  be  identified  during  the  next  r  intervals  in  excess  of  Sq, 
the  estimated  errors  already  found,  is  determined  by: 

Sr  =  Sk-Sn 

Where  S„  is  determined  using: 

Sn  =  E(BMIN)  •  (l— e"BMIN  “2) 

Note  that  S„  is  an  estimated  value  that  may  differ  from  the  actual  number  of  errors  you  enter, 
especially  if  you  have  null  data  in  one  or  more  intervals. 

Thus,  Sr  becomes: 

Sr  =  E(BMIN)  •  ( e-BMIN  ■  n2_e~BMIN  •  (n+ r)J 'i 
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A3  MODE  2 — PHASE -BASED  MODEL  APPROACH  TO  ERROR  ESTIMATION 
AJ.l  Overview 

Mode  2  fits  a  Rayleigh  curve  to  error  discovery  count  data  obtained  on  a  phase-by-phase  basis.  It  fits 
a  Rayleigh  curve  to  the  numbers  of  errors/code  unit  discovered  during  each  phase  of  the  software  de¬ 
velopment  process.  The  model  requires  data  from  at  least  two  different  development  phases;  howev¬ 
er,  they  need  not  be  from  contiguous  phases.  The  model  am  estimate  the  errors  expected  to  be  found 
in  later  phases  based  on  number  of  errors  discovered  in  earlier  phases.  It  can  also  be  employed  to  esti¬ 
mate  the  latent  error  content— the  expected  number  of  errors  in  the  post-delivery  software.  For  the 
pre-test  phases,  the  error  counts  are  typically  found  during  inspections.  Using  this  mode  enables  you 
to  use  valuable  data  obtained  before  die  code  executes.  This  type  of  data  is  often  ignored  by  software 
analysts  and  quality  assurance  personnel  when  estimating  errors. 

A33  Rayleigh  Curve 

An  analysis  of  the  number  of  errors  discovered  at  successive  stages  of  the  software  development 
process  indicates  that  the  normalized  error  discovery  profile,  when  taken  on  a  phase-by-phase  basis, 
typically  increases  and  then  decreases  as  a  function  of  phase  as  you  move  through  the  process  (e.g., 
high-level  design  inspection  to  low-level  design  inspection).  It  is  monomodal  (has  a  single  peak)  in 
form.  Thus,  errors  per  code  unit  may  be  plotted  as  a  function  of  each  error  discovery  phase.  The  use 
of  errors  per  code  unit  allows  you  to  use  error  discovery  data  normalized  with  respect  to  the  actual 
code  size.  Hence,  you  can  employ  data  about  different  parts  of  a  software  product  in  developing  an 
error  profile,  assuming  a  reasonable  degree  of  homogeneity  across  its  parts  (modules  or  other 
elements  of  program  structure). 

One  representation  of  a  monomodal  model  is  the  Rayleigh  curve  (actually  a  discrete  or  histogram 
form  since  the  data  is  discrete)  depicted  in  Figure  A-l.  The  Rayleigh  curve  was  selected  as  the  model 
of  software  error  discovery  used  in  SWEEP  because  of  its  high  level  of  familiarity  within  the  software 
community.  It  is  widely  used  as  a  model  of  the  “proper”  application  of  labor  to  develop  products  in 
general  (Norden  1958;  Norden  1963)  and  more  particularly  for  the  software  development  process 
(Gaffney  1982)  as  well  as  the  entire  software  development  life  cycle  (Putnam  1978).  The  cumulative 
form  of  the  Rayleigh  curve,  as  applied  to  the  error  discovery  process  model  used  in  SWEEP,  is: 

V,  =  E(l  -  e“Bt2) 
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Development 
Phase  t 


Rayleigh  curve  fit:  AVt  —  Eje8**-1)2  —  e-B,2J 


E  11  Total  lifetime  error  rate  per  code  unit 


B- 


Defect  discovery  phase  constant,  the  location  of  the  peak  in  a  continuous  fit  to  the  data 


Rayleigh  curve,  cumulative  form: 
v,  -  E(l  -  e-*2) 


Figure  A-l.  "typical  Rayleigh  Distribution  Curve  for  a  Six-Phase  Development  Process 


where: 

Vt  =  Tbtal  number  of  errors  (or  errors  per  KSLOC)  discovered  through  development  phase 
(or  activity  number  t) 

E  =  Tbtal  lifetime  error  content  or  errors  injected  during  development — the  area  under 
the  Rayleigh  curve  between  t=0  and  t=oo 

B  =  Shape  parameter  of  the  distribution  curve 


where  tp=  error  discovery  phase  constant,  the  point  at  which  39%  of  E  errors  have  been  discovered. 

The  independent  variable  t  represents  the  error  discovery  activity  indexes,  as  illustrated  in  Figure  A-l 
for  the  case  of  six  error  discovery  activities.  SWEEP  allows  you  to  have  between  six  and  nine  phases. 


ArS 
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The  incremental  (phase-by-phase)  form  of  this  model  is: 

AV,  =  Eje^*'1)2  -  e_BtJj 

where  AVt  =  the  number  of  errors  (or  errors  per  KSLOC)  discovered  during  development  phase  t, 
such  as  high-level  design. 

The  number  of  latent  errors  L,  the  amount  of  errors  remaining  at  the  conclusion  of  development 
(including  testing)  process,  is  given  by: 


L  =  Ee'BM1 


where  M  is  the  number  of  error  discovery  phases  in  the  development  process.  If  M=  6,  then: 

L  =  Ee-36B 


The  efficiency  of  the  error  discovery  process,  EFF,  is  defined  as: 

(l-i)  x  100  or  x  100 


Thus,  for  the  case  of  six  cases: 


EFF  =  1  -  e'368 

Higher  efficiency  processes  have  larger  values  of  B  or  smaller  values  of  tpj  the  earlier  the  peak  is 
readied,  the  higher  the  efficiency  of  the  error  discovery  process. 

It  is  interesting  that  the  two  parameters  of  the  Rayleigh  curve,  the  location  of  the  peak  (tp)  and  the 
area  under  the  curve  (E),  correspond  to  mutually  exclusive  aspects  of  software  error  discovery.  E  cor¬ 
responds  to  the  quality  of  the  development  process,  which  relates  directly  to  the  area  under  the  curve. 
Poorer  processes  produce  (inject)  more  errors  and  have  higher  values  of  E.  Poor  verification  methods 
let  more  of  the  injected  errors  leak  to  latter  phases,  resulting  in  higher  values  of  tp,  which  is  the  location 
of  the  peak. 

A33  Nature  of  Fitting  the  Data  to  the  Rayleigh  Curve 

SWEEP  employs  an  iterative  procedure  to  fit  the  errors/code  unit  data  for  each  phase  of  available 
data.  It  produces  estimates  of  the  values  of  error/code  unit  for  each  phase  and  compares  them  with 
the  actual  values  that  you  enter.  The  algorithm  estimates  the  values  of  tp  ranging  from  phase  variable 
t=0.5  to  t=M  with  increments  of  0.1,  where  M  (6sMs9)  is  the  number  of  phases  in  the  discovery  pro¬ 
file  you  select  For  example,  the  ranges  of  values  of  phase  variable  t  for  die  six  phases  predefined  in 
the  SWEEP  tool  are: 

•  Values  of  0  to  1:  Phase  1 — Preliminary  design 

•  Values  of  1  to  2:  Phase  2 — Detailed  design 

•  Values  of  2  to  3:  Phase  3 — Coding 


M 
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•  Values  of  3  to  4:  Phase  4 — Unit  test  (CSU  testing  in  DOD-STD-2167A) 

(U.S.  Department  of  Defense  1988) 

•  Values  of  4  to  5:  Phase  5— Integration  test  (CSC  testing  in  DOD  -STD-2167A) 

•  Values  of  5  to  6:  Phase  6 — System  test  (CSCI  and  system  testing  in  DOD-STD-2167A) 

The  value  of  0.5  indicates  that  it  is  half-way  through  phase  one,  and  a  value  of  6  indicates  the  end  of 
phase  six.  The  Rayleigh  distribution  curve  shown  in  Figure  A-l  depicts  these  six  phases.  The  algorithm 
used  in  SWEEP  calculates  the  square  of  the  difference  between  the  actual  and  the  estimate  for  each 
of  the  data  points  you  enter.  The  minimum  value  for  this  estimate  corresponds  to  the  best  fit  for  the 
Rayleigh  approximation,  which  will  be  used  to  fit  the  data  you  provide  to  SWEEP. 

There  is  no  phase  dedicated  to  requirements  in  the  predefined  set  of  SWEEP  phases. 

Let  U(I)  be  the  number  of  errors  that  you  have  found  for  phase  I  (I  =  1 ,  ...M,  6sMs9).  You  must  enter 
data  for  at  least  two  phases  (not  necessarily  contiguous).  The  algorithm  calculates  the  parameter  A(I) 
for  each  U(I)  according  to  whether  you  have  enter?  1  a  value  for  U(I).  A(I)=1  if  you  have  entered  a 
value  for  U(I);  A(I)=0  otherwise. 

The  following  is  a  description  of  the  nature  of  the  iterative  algorithm  that  determines  the  two 
parameters  of  the  Rayleigh  curve  (E  =  the  total  area,  and  tp  =  the  peak)  that  best  fit  the  data  (the 
U(I)  that  you  have  entered).  First,  SWEEP  iteratives  through  phase  variable  t  (the  trial  values  of  tp), 
ranging  0.5,  in  increments  of  0.1,  to  M,  where  M  (6sMs9)  is  the  number  of  phases  in  the  discovery 
profile  you  select,  to  find  the  smallest  value  of  D(B),  the  squared  error,  using  the  following  formulas: 


and  t  =  -7= 

M 


E(B)  = 


f>D '  U(l) 

1-1 _ 

M 

]TA(I)  •  (e-B(i-i)J_e-B-iJ) 
i-i 


DV(I,  B)  =  E(B)  •  (e"B(I"1)2-e"BIi) 

M 

D(B)=  £a<I)-(U(I)-DV(I,B))2 
1-1 

Let  BMIN  be  the  value  of  B  that  produced  D(BMIN),  the  smallest  value  of  D(B).  Then,  the  best 
estimate  of  tp,  denoted  as  TMIN,  is  given  by: 

TMIN  =  .  -  1  - 
/2BMIN 

Also,  E(B)  is  the  value  of  Rayleigh  E — the  total  errors  associated  with  the  trial  value  of  B.  Thus,  the 
best  estimate  of  E,  the  total  lifetime  errors,  symbolized  as  E(BMIN),  is  given  by: 
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£a(I)'U(1) 

E(BMIN)  =  — - - 

j^A(I)  •  (e-BMIN(I-l)’  _  e-BMIN  P) 
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A.4  MODE  3 — PLANNING  AID 
A.4.1  Overview 

Mode  3  allows  you  to  set  objectives  for  error  discovery  ranges  based  on  user-specified  criteria.  Mode 
3  uses  a  phase-based  model  based  on  a  Rayleigh  curve  to  estimate  latent  error  content  and  the  peak 
location  of  errors. 

A.4.2  Software  Statistical  Quality  Control 

Some  of  the  basic  elements  of  software  statistical  quality  control  that  are  directly  related  to  Mode  3 
are: 


•  Using  graphs  to  present  goals  and  actual  performance  at  each  phase  of  the  software 
development  process.  Note  that  the  software  process  activities,  such  as  preliminary  design,  are 
analogous  to  the  stations  of  a  manufacturing  process. 

•  Plotting  error  discovery  versus  time  or  equivalent  (development  phase  number). 

•  Defining  a  control  (tolerance)  band  that  establishes  a  tolerance  range  for  each  error  discovery 
phase;  the  tolerance  range  relates  to  the  control  limits  of  your  quality  control  process. 

•  Using  the  graphs  to  indicate  departures  from  satisfactory  (quality-level)  behavior  and  to 
anticipate  and  help  forecast  departures  from  it. 

By  using  the  planning  mode  with  the  aid  of  the  graphics  it  provides,  you  can  employ  software  product 
data  to  make  inferences  about  the  development  process  used  to  create  the  product,  both  during  devel¬ 
opment  and  after  it  has  been  completed.  By  monitoring  the  product,  you  can  improve  your  control  of 
the  software  development  process.  Such  data  can  be  used  in  your  decision  process  and  action  strategy 
to  support  modifying  the  product  under  development  while  you  are  creating  it,  the  process  you  employ 
to  produce  it,  or,  if  post  facto  analysis  is  employed,  the  process  after  you  have  created  it.  The  Mode  3 
planning  aid  provides  developers  with  data  in  a  timely  manner  to  aid  them  in  discovering  departures 
from  the  development  process  and  supports  returning  it  to  control.  You  can  use  the  information  dis¬ 
played  in  the  planning  mode  graphs  to  prompt  corrective  action. 

The  planning  aid  provides  an  alternative  to  establishing  tolerance  profiles  and  is  based  on  the  work 
done  by  Genichi  Thguchi  (Barker  1990).  Thguchi  has  focused  on  improving  the  quality  of  measuring 
systems  and  measuring  procedures  by  introducing  the  concept  of  sensitivity  (Cherng  1989).  Thguchi 
suggests  using  the  concept  of  signal-to-noise  ratio  to  evaluate  the  sensitivity  of  a  measurement  system. 
The  concept  of  signal-to-noise  ratio  is  widely  applied  by  electrical  engineers.  The  basic  concept  is  that 
there  is  a  desired  signal  value,  G,  and  it  is  embedded  in  noise  whose  power  is  o2,  corresponding  to 
statistical  variance.  Then  S,  the  signal-to-noise  ratio,  is  equal  to  G /  o.  The  inverse  of  this  value  is  known 
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to  statisticians  as  the  coefficient  of  variation,  when  G  is  taken  as  the  expected  value  of  a  random  vari¬ 
able.  Larger  values  of  signal-to-noise  ratio  correspond  to  smaller  values  of  the  error  tolerance  range. 
It  does  not  appear  that  these  ideas  have  been  applied  to  software  before  the  application  in  SWEEP 
as  described  here. 

A.4J  Equations  for  the  Error  Discovery  Profiles  Displayed 

The  primary  equation  in  the  planning  aid  is  the  same  one  used  in  the  phase-based  model.  Listed  below 
are  the  primary  inputs  required  for  using  Mode  3. 

There  are  two  ways  to  establish  the  goal  error  discovery  profile  pattern.  You  can  enter: 

•  E,  the  desired  total  error  content  (total  injected),  and  L,  the  desired  latent  error  content 

•  E,  the  desired  total  error  content  (total  injected),  and  tp,  the  desired  location  of  peak  error 
discovery 

There  are  two  ways  to  establish  the  (optional)  tolerance  profile  patterns.  You  can  enter: 

•  pu,  the  desired  percentage  upper  tolerance  level,  and  pi,  the  desired  percentage  lower  tolerance  level 

•  S,  the  desired  value  of  the  signal-to-noise  ratio 

The  equation  for  the  goal  error  profile  (Gt)  value  for  phase  t  is: 

Gt  =  E[e-B(t_1)2-e-Bt2] 

The  equations  for  the  upper  (Ut )  and  lower  (Lt )  error  profile  values  for  phase  t  are: 

U,  =  E  •  (1  +  pu  •  0.01)  •  |e~B(t_l)J-e~Bt2] 
and 

L,  =  E  •  (1  -  p,  •  0.01)  •  [e-B(t-1)2-e~Bl2] 

When  you  enter  the  desired  signal-to-noise  value  S,  the  program  sets  Pu=Pi=p,  and  p=100/S. 
Then, 


and 


U,  =  1}-  •  [e-B(t-1)2-e“Bt2] 


Lt  =  •  [e-^-^-e-8'2] 


Note  that  S  must  be  greater  than  1  for  these  equations  to  hold. 
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GLOSSARY 


Code  execution 
Code  inspection 
Cumulative  curve 

Defect 

Defect  density 
Defect  discovery 
Defect  discovery  profile 

Error 


Error  detection 


When  code  has  the  ability  to  demonstrate  the 
functionality  of  the  particular  unit  of  code. 

Peer  review  of  a  programmer’s  code  to  find  problems 
and  improve  quality. 

A  graphical  representation  of  a  Rayleigh  distribution 
curve.  The  Y,  or  vertical,  axis  represents  the  total 
number  of  errors  discovered,  and  the  X,  or 
horizontal,  axis  represents  successive  test  intervals. 

See  Error. 

The  ratio  of  the  number  of  defects  to  the  amount  of 
code  in  which  they  were  discovered. 

The  finding  of  defects  in  a  software  element  during 
a  verification  activity. 

A  plot  of  the  number  of  defects  (errors)  found  in  each 
development  phase  or  in  each  time  interval,  over  all 
of  the  development  phases  or  test  time  intervals, 
respectively. 

The  term  error  is  summarized  and  defined  using  the 
following  four  definitions: 

Failure:  A  manifestation  of  an  inability  of  the 
software  product  to  conform  to  requirements. 

Fault:  An  error  in  software  that  could  cause  it  to 
produce  an  incorrect  result  or  an  invalid  output. 

Defect:  The  evidence  of  the  existence  of  a  fault. 

Error:  The  human  action  of  commission  or  omission 
that  results  in  the  software  containing  a  fault.  See 
also :  Injected  error. 

The  method  or  practice  used  to  discover  an  error 
during  software  development.  Same  as  error 
removal. 


Gk>l 


Error  discovery  profile 


Error  discovery  rate 
Error  prevention 
Expansion  ratio 

Fault 

Failure 

Incidence  curve 

Injected  error 
Latent  error  content 
Macro  error 

Metrics 

Peak  location 
Radio  buttons 

Rayleigh  distribution  curve 


A  plot  of  the  number  of  errors  (defects)  found  in  each 
development  phase  or  in  each  time  interval,  over  all 
of  the  development  phases  or  test  time  intervals, 
respectively. 

The  rate  at  which  errors  are  discovered  during  a 
given  time  interval  or  development  phase. 

The  process  of  detecting,  fixing,  and  preventing 
errors  from  reoccurring. 

The  ratio  of  the  number  of  statements  of  code  to  the 
number  of  statements  of  design  from  which  they  were 
derived. 

See  Error. 

See  Error. 

Agraphical  representation  of  a  Rayleigh  distribution 
curve  for  a  particular  interval.  The  Y,  or  vertical,  axis 
represents  the  number  of  errors  discovered  in  the 
interval,  and  the  X,  or  horizontal,  axis  represents  one 
or  more  data  groupings  for  the  interval. 

An  error  introduced  into  the  software  during  any 
phase  of  software  development. 

The  number  of  errors  remaining  in  the  software  at 
the  time  the  software  is  delivered  to  the  customer. 

When  running  SWEEP  an  EXCEL  macro 
instruction  fails,  and  an  indication  of  this  is  provided 
to  the  user. 

Quantities  indicative  of  some  aspect  of  a  software 
product,  such  as  the  number  of  defects  found  during 
the  preliminary  design  activity  or  during  the  period 
of  testing. 

The  number  of  the  time  interval  or  phase  of  the  peak 
of  the  Rayleigh  fit  to  the  error  or  defect  discovery. 

User  selectable  buttons  on  SWEEP  dialog  boxes  that 
have  the  form  of  a  small  circle  with  a  dot  in  the 
middle. 

The  Rayleigh  curve  fit  to  error  or  defect  discovery 
data  produced  by  the  SWEEP  models. 


Glo-2 


GhgiMy 


Signal-to-noisc  ratio 


Squared  error 


Tblerance  band 


Verification 


The  ratio  of  the  amplitudes  of  a  signal  and  the  noise 
in  which  in  which  it  is  imbedded.  Used  in  SWEEP  to 
indicate  the  ratio  of  the  desired  value  of  error 
discovery  in  some  particular  phase  and  the  size  of  the 
tolerance  about  it. 

The  sum  of  the  squares  of  the  differences  between 
the  error  count  data  entered  into  SWEEP  by  a  user 
and  the  counts  that  SWEEP  generates  as  part  of  the 
algorithm  SWEEP  employs  to  fit  a  Rayleigh  curve  to 
the  user-entered  data. 

The  user  selected  range  about  a  desired  value  or  goal 
for  defect  discovery  in  each  phase  of  the  discovery 
profile. 

The  process  of  determining  whether  a 
representation  of  a  software  system,  such  as  detailed 
design,  is  a  correct  elaboration  of  the  requirements. 
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LIST  OF  ABBREVIATIONS  AND  ACRONYMS 


CMM 

CSC 

csa 

csu 

ESP 

KSLOC 

PC 

RAM 

SEI 

SWEEP 


Capability  Maturity  Model 
computer  software  component 
computer  software  configuration  item 
computer  software  unit 
Evolutionary  Spiral  Process 
thousand  source  lines  of  code 
personal  computer 
random  access  memory 
Software  Engineering  Institute 
Software  Error  Estimation  Program 
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